How did EXO Labs get a lightweight Llama 2 running on a 1997 Pentium II with just 128 MB of RAM? By leaning on BitNet’s ternary-weight approach (-1, 0, 1), the team showed the model could respond, slowly, underscoring that software optimization, not new silicon, can unlock surprising headroom on legacy machines. EXO Labs just