Member-only story
OpenHermes 2.5 Mistral 7B beats Deepseek 67B and Qwen 72B on AGIEVal, and other 13B and 7B models!
11 min readDec 3, 2023
Today, I’m zeroing in on OpenHermes, a model that’s been turning heads in the LLM community recently. It became the go-to LLM, and its derivatives keep topping the Open LLM Leaderboard.
Join our next cohort: Full-stack GenAI SaaS Product in 4 weeks!
Without much fluff, I’ll walk you through the local setup for:
- OpenHermes 2.5 Mistral 7B (beats Deepseek 67B and Qwen 72B on AGIEVal)
- OpenHermes-2.5-neural-chat-7b-v3–1–7B (#1 in 7B and 13B category)
and I will ask the following questions to compare their answers:
- Language Understanding and Creativity: “How would you explain the concept of democracy to a 10-year-old?”
- Problem-Solving and Logical Reasoning: “If a train travels at 60 miles per hour and has to cover a distance of 120 miles, how long will it take to reach its destination?”
- General Knowledge and Fact Verification: “Can you provide a summary of the French Revolution?”
Let’s get going!