Member-only story

OpenHermes 2.5 Mistral 7B beats Deepseek 67B and Qwen 72B on AGIEVal, and other 13B and 7B models!

Agent Issue
11 min readDec 3, 2023

--

Today, I’m zeroing in on OpenHermes, a model that’s been turning heads in the LLM community recently. It became the go-to LLM, and its derivatives keep topping the Open LLM Leaderboard.

Join our next cohort: Full-stack GenAI SaaS Product in 4 weeks!

Without much fluff, I’ll walk you through the local setup for:

  • OpenHermes 2.5 Mistral 7B (beats Deepseek 67B and Qwen 72B on AGIEVal)
  • OpenHermes-2.5-neural-chat-7b-v3–1–7B (#1 in 7B and 13B category)

and I will ask the following questions to compare their answers:

  • Language Understanding and Creativity: “How would you explain the concept of democracy to a 10-year-old?”
  • Problem-Solving and Logical Reasoning: “If a train travels at 60 miles per hour and has to cover a distance of 120 miles, how long will it take to reach its destination?”
  • General Knowledge and Fact Verification: “Can you provide a summary of the French Revolution?”

Let’s get going!

Understanding OpenHermes 2.5 Mistral 7B

--

--

Agent Issue
Agent Issue

Written by Agent Issue

Your front-row seat to the future of Agents.

Responses (1)