I’m curious about what the consensus is here for which models are used for general purpose stuff (coding assist, general experimentation, etc)

What do you consider the “best” model under ~30B parameters?

  • SmokeyDope@lemmy.worldM
    link
    fedilink
    English
    arrow-up
    3
    ·
    5 days ago

    I’m a big fan of NousResearch their deephermes release was awesome and now I’m trying out Hermes 4. I have an 8gb 1070ti GPU was able to fully offload a medium quant of hermes 4 14b with an okay amount of context.

    I’m a big fan of the hybrid reasoning models I like being able to turn thinking on or of depending on scenario.

    I had a vision model document scanner + TTS going on with a finetune of qwen 2.5 vl and outetts.

    If you care more about character emulation for writing and creativity then mistral 2407 and mistral NeMo are other models to check out.