Notes on LLMs
2024-06-12
LLMs are distributed as huge files that contain compressed "weights". Weight is a probability of signal going from one node in the network to another.
LLMs need tools to run them. Most tools have some kind of llama
reference in their name.
llamafile
makes standalone LLM executables. It packs LLM model together with llama.cpp
tool in one file, which you can download and run right away.