A recent hands-on comparison put three local large language models—Gemma 4 E4B, gpt-oss 20B, and Qwen 3.5 9B—through identical real-world tasks to assess practical usability. The tests, run on an RTX ...