If you can run 13B, I would recommend the following models. The links are for the GGML models. Keep in mind that if you can run a 30B or larger model, there are other LLMs that will work better.
This one is my personal current favorite. I think that it’s better than Chronos for short messages. For me it usually sticks to using asterisks for actions, and quotes for speech, which is what I prefer.
This one feels more “clever” to me all around, and is currently very popular for roleplay. It produces results that are usually longer, so I feel like its better suited for longer dialogue. It also feels to me like it understands the scenarios better, and I usually get slightly more creative results from it.
If you can run 13B, I would recommend the following models. The links are for the GGML models. Keep in mind that if you can run a 30B or larger model, there are other LLMs that will work better.
TheBloke/manticore-13b-chat-pyg-GGML
This one is my personal current favorite. I think that it’s better than Chronos for short messages. For me it usually sticks to using asterisks for actions, and quotes for speech, which is what I prefer.
TheBloke/chronos-wizardlm-uc-scot-st-13B-GGML
This one feels more “clever” to me all around, and is currently very popular for roleplay. It produces results that are usually longer, so I feel like its better suited for longer dialogue. It also feels to me like it understands the scenarios better, and I usually get slightly more creative results from it.