The Latest AI Innovations: GPT-4o Mini, Mistral NeMo, DeepSeek V2, and SmolLM

The Latest AI Innovations: GPT-4o Mini, Mistral NeMo, DeepSeek V2, and SmolLM

Major AI players unveiled new models this week, enhancing accessibility and capabilities, with innovations from OpenAI, Mistral AI, and Hugging Face.

Jesse Anglen
July 24, 2024

looking for a development partner?

Connect with technology leaders today!

Schedule Free Call

This week, the AI world has been buzzing with excitement as major players like OpenAI, Mistral AI, NVIDIA, DeepSeek, and Hugging Face unveiled their latest models and innovations. These new releases promise to make AI more powerful, affordable, and accessible. With advancements in training techniques, these developments are set to transform various industries, showcasing the rapid progress and expanding capabilities of AI technology.


OpenAI has launched GPT-4o Mini, a cost-effective and highly capable model designed to replace GPT-3.5 Turbo. Priced at $0.15 per million input tokens and $0.60 per million output tokens, GPT-4o Mini offers improved intelligence and a 128k context window, making it accessible to a broader audience. The release has generated excitement due to its potential to democratize access to advanced AI capabilities, though some users have reported limitations in handling large code edits efficiently.


Mistral NeMo, in collaboration with NVIDIA, unveiled the Mistral NeMo model, a 12B parameter model with a 128k token context window. This model promises state-of-the-art reasoning, world knowledge, and coding accuracy, available under the Apache 2.0 license. Mistral NeMo is designed for broad adoption. While the model’s capabilities are impressive, some users have raised skepticism about its benchmarking accuracy compared to models like Meta Llama 8B, sparking heated debates among AI engineers.


DeepSeek V2 has significantly reduced inference costs, sparking a competitive pricing war among Chinese AI companies. Known as China’s “AI Pinduoduo,” DeepSeek V2’s cost-cutting innovations could disrupt the global AI landscape.


, released by Hugging Face, offers a series of small language models in three sizes: 135M, 360M, and 1.7B parameters. These models are trained on Cosmo-Corpus, which comprises Cosmopedia v2 (28B tokens of synthetic educational content), Python-Edu (4B tokens of Python programming examples), and FineWeb-Edu (220B tokens of deduplicated web data). The SmolLM models have demonstrated impressive performance in common sense reasoning and world knowledge benchmarks, positioning them as strong contenders in their size category.


Mistral AI’s Mathstral model, developed in collaboration with Project Numina, is fine-tuned for STEM reasoning, achieving impressive scores on MATH and MMLU benchmarks. Mathstral 7B obtains 56.6% pass@1 on MATH, outperforming Minerva 540B by 20%+. The model exemplifies the growing trend of specialized models optimized for specific domains, potentially reshaping AI applications in scientific and technical fields.


Codestral Mamba, a new model from Mistral AI, offers linear time inference and the ability to handle infinitely long sequences, co-developed by Albert Gu and Tri Dao. The model aims to enhance coding productivity, outperforming existing SOTA transformer-based models while providing rapid responses regardless of input length. The release has generated excitement for its potential impact on LLM architectures, with some noting it’s not yet supported in popular frameworks like llama.cpp.


H2O Danube3 introduces a groundbreaking framework for textual feedback differentiation within neural networks, opening new avenues for optimizing compound AI systems beyond conventional methods. The innovative STORM system demonstrates a 25% improvement in article organization by simulating diverse perspectives, enabling LLMs to generate grounded and structured long-form content akin to Wikipedia entries. Researchers herald TextGrad as a paradigm shift in AI, allowing the orchestration of multiple large language models (LLMs) for enhanced performance.


This article highlights the rapid advancements in AI technology and the potential impact of these new models on various industries. Stay tuned for more updates on the latest AI innovations! For more insights, visit our Rapid Innovation Blogs.


Top Trends

Latest News

Get Custom Software Solutions &
Project Estimates with Confidentiality!

Let’s spark the Idea