Thursday, November 21, 2024

Chinese DeepSeek-R1 AI Model With Advanced Reasoning Capabilities Released, Can Rival OpenAI o1

Date:

Share post:


A Chinese artificial intelligence (AI) model was released on Wednesday which claims to take on OpenAI’s o1 AI model in terms of advanced reasoning. Dubbed DeepSeek-R1-Lite-Preview, the large language model (LLM) is said to have outperformed the o1 model on several benchmarks. Notably, the AI model is available to test on the web for free, although its advanced reasoning feature can only be used a select number of times. Additionally, the AI model also offers a transparent thought process which users can see to gauge how the output decision was made.

DeepSeek-R1 AI Model Unveiled

Advanced reasoning is a relatively new capability in LLMs which allows them to make decisions with multi-step thought processes. There are several advantages to this. For one, such AI models can answer more complex queries and require an understanding of deeper context and expert-level knowledge of the topic. Another, such AI models can also fact-check themselves minimising the risk of hallucination.

However, so far, not many foundation models are capable of advanced reasoning. While some mixture-of-agent (MoE) models can do this, they are built of multiple smaller models. In the mainstream space, OpenAI o1 series models are known for this capability.

But, on Wednesday, DeepSeek, a Chinese AI firm, posted on X (formerly known as Twitter) announcing the release of the DeepSeek-R1-Lite-Preview model. The company claims it can outperform the o1-preview model on the AIME and MATH benchmarks. Notably, both of these test the mathematical and reasoning abilities of an LLM.

Gadgets 360 staff members were able to access the chatbot and found that the AI model also shows the entire chain of thought after submitting a query. This allows users to understand the logical connection being made by the model, and spot any shortcomings. In our testing, we found the AI model capable of answering complex questions.

The response time was also short, making the conversation flow efficient. At present, users only get 50 messages to try out the “Deep Think” mode which shows the model’s thought process. Additionally, currently, this is the only free-to-use AI model with advanced reasoning. Interested individuals can try out the AI chatbot on the web here.

Notably, the company has claimed that it will open-source the full version of the DeepSeek-R1 AI model in the near future, which would be a first for an LLM of this class.

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who’sThat360 on Instagram and YouTube.


Samsung’s Black Friday Sale: Discounts on Galaxy Watch Ultra, Galaxy Watch 7, Galaxy Buds 3 Series, More




LEAVE A REPLY

Please enter your comment!
Please enter your name here

spot_img

Related articles

US, India Extend Digital Tax Truce to June 30 as Deadline Approaches

The United States and India have extended a standstill agreement on U.S. retaliation over India's digital-services tax...

Inputs from support staff vital: Kapil Dev

Is the India-Australia series more significant than the Ashes?It is an important series. There is no need...

Samsung Galaxy S24 Ultra 5G vs Apple iPhone 16 Pro Max: Which is Better?

Samsung and Apple are currently the two brands that have dominated the ultra-premiums segment in India. Samsung...

MSI Claw A1M Review: Late Entry into Handheld Gaming

Technology might seem like it's constantly moving forward on a curve, but it's often cyclical. Flip phones,...