Binary
FundamentalsA compiled, executable file that a computer can run directly, as opposed to source code that must be interpreted or compiled first.
A binary is a file containing machine code or bytecode that a computer can execute directly. In software development, the term refers to the compiled output of source code - the ready-to-run program that results from translating human-readable code into instructions a processor can understand.
In the context of AI and machine learning, binaries are important when deploying models to production. ML frameworks like TensorFlow and PyTorch compile computational graphs into optimized binaries for inference. Tools like ONNX Runtime, TensorRT, and llama.cpp produce binaries tuned for specific hardware, enabling faster model execution on CPUs, GPUs, or specialized accelerators.
The distinction between source code and binaries matters for distribution and deployment. Pre-built binaries allow users to run software without needing compilers or development toolchains, which simplifies the process of setting up inference servers, running local LLMs, and deploying AI applications to edge devices.
Related Terms
Last updated: February 25, 2026