Close Menu
Edu Expertise Hub
    Facebook X (Twitter) Instagram
    Friday, May 9
    • About us
    • Contact
    • Submit Coupon
    Facebook X (Twitter) Instagram YouTube
    Edu Expertise Hub
    • Home
    • Udemy Coupons
    • Best Online Courses and Software Tools
      • Business & Investment
      • Computers & Internet
      • eBusiness and eMarketing
    • Reviews
    • Jobs
    • Latest News
    • Blog
    • Videos
    Edu Expertise Hub
    Home » Latest News » Latest Alibaba AI model demos AI improvements
    Latest News

    Latest Alibaba AI model demos AI improvements

    TeamBy TeamMarch 8, 2025No Comments3 Mins Read0 Views
    Facebook Twitter Pinterest LinkedIn Telegram Tumblr Email
    Latest Alibaba AI model demos AI improvements
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Just two months after the tech world was upended by the DeepSeek-R1 AI model, Alibaba Cloud has introduced QwQ-32B, an open source large language model (LLM).

    The Chinese cloud giant describes the new model as “a compact reasoning model” which uses only 32 billion parameters, yet is capable of delivering performance comparable to other large language AI models that use larger numbers of parameters.

    On its website, Alibaba Cloud published performance benchmarks which suggest that the new model is comparable to AI models from DeepSeek and OpenAI. These benchmarks include AIME 24 (mathematical reasoning), Live CodeBench (coding proficiency), LiveBench (test set contamination and objective evaluation), IFEval (instruction-following ability), and BFCL (tool and function-calling capabilities).

    By using continuous reinforced learning (RL) scaling, Alibaba claimed the QwQ-32B model demonstrates significant improvements in mathematical reasoning and coding proficiency.

    In a blog post, the company said QwQ-32B, which uses 32 billion parameters, achieves performance comparable to DeepSeek-R1, which uses 671 billion parameters. Alibaba said that this shows the effectiveness of RL when applied to robust foundation models pretrained on extensive world knowledge.

    “We have integrated agent-related capabilities into the reasoning model, enabling it to think critically while utilising tools and adapting its reasoning based on environmental feedback,” Alibaba said in the blog post. 

    Alibaba said QwQ-32B demonstrates the effectiveness of using reinforcement learning (RL) to enhance reasoning capabilities. With this approach to AI training, a reinforcement learning AI agent is able to perceive and interpret its environment, as well as take actions and learn through trial and error. Reinforcement learning is one of several approaches developers use to train machine learning systems. Alibaba used RL to make its model more efficient.

    “We have not only witnessed the immense potential of scaled RL, but also recognised the untapped possibilities within pretrained language models,” Alibaba said. “As we work towards developing the next generation of Qwen, we are confident that combining stronger foundation models with RL powered by scaled computational resources will propel us closer to achieving Artificial General Intelligence [AGI].”

    Alibaba said it is actively exploring the integration of agents with RL to enable what it describes as “long-horizon reasoning” which, according to Alibaba, will eventually lead to greater intelligence with inference time scaling.

    The QwQ-32B model was trained using rewards from a general reward model and rule-based verifiers, enhancing its general capabilities. According to Alibaba, these include better instruction-following, alignment with human preferences and improved agent performance.

    China’s DeepSeek, which has been generally available since the start of the year, demonstrates the effectiveness of RL in its ability to deliver comparable benchmark results compared to rival US large language models. Its R1 LLM can rival US artificial intelligence without the need to resort to the latest GPU hardware.

    The fact that Alibaba’s QwQ-32B model also uses RL is no coincidence. The US has banned the export of high-end AI accelerator chips – such as the Nvidia H100 graphics processor – to China, which means Chinese AI developers have had to look at alternative approaches to making their models work. Using RL does appear to deliver comparable benchmark results compared with what models like those from OpenAI are able to achieve.

    What is interesting about the QwQ-32B model is that it uses significantly fewer parameters to achieve similar results to DeepSeek, which effectively means that it should be able to run on less powerful AI acceleration hardware.

    This post is exclusively published on eduexpertisehub.com

    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Team

      Related Posts

      US tells CNI orgs to stop connecting OT kit to the web

      May 8, 2025

      Pre-K Spending and Enrollment Reach All-Time High, But Quality Concerns Remain

      May 8, 2025

      Ignite Reading Partners with UF Lastinger Center’s Florida Tutoring Advantage

      May 7, 2025

      UK hands Indian IT suppliers competitive boost in trade deal

      May 7, 2025

      Every Student Deserves High-Quality Computer Science Education

      May 7, 2025

      With VR goggles, students in detention centers gain career training

      May 6, 2025
      Courses and Software Tools

      Extreme Privacy: What It Takes to Disappear

      August 24, 202436 Views

      Modern C++ Programming Cookbook: Master Modern C++ with comprehensive solutions for C++23 and all previous standards

      September 18, 202423 Views

      Meebook E-Reader M7 | 6.8′ Eink Carta Screen | 300PPI Smart Light | Android 11 | Ouad Core Processor | Out Speaker | Support Google Play Store | 3GB+32GB Storage | Micro-SD Slot | Gray

      August 19, 202421 Views

      Coders at Work: Reflections on the Craft of Programming

      April 19, 202516 Views

      Bigme inkNote Color + Lite Eink Tablet 10.3″ eBook Reader 4G 64GB eReader for Reading and Writing ePaper Tablet Digital Notepad with Stylus and Cover

      June 13, 202413 Views
      Reviews

      Model Context Protocol(MCP) Implementation in C# | Udemy Coupons 2025

      May 9, 2025

      Locum Physician (MD/DO) – Anesthesiology in Bemidji, MN

      May 9, 2025

      The Basics of Process Improvement

      May 9, 2025

      Improve Your Social Skills

      May 9, 2025

      Motorola Edge + |2022| 4800mAh Battery | Unlocked | Made for US 8/512GB | 50MP Camera | Cosmos Blue

      May 9, 2025
      Stay In Touch
      • Facebook
      • YouTube
      • TikTok
      • WhatsApp
      • Twitter
      • Instagram
      Latest News

      US tells CNI orgs to stop connecting OT kit to the web

      May 8, 2025

      Pre-K Spending and Enrollment Reach All-Time High, But Quality Concerns Remain

      May 8, 2025

      Ignite Reading Partners with UF Lastinger Center’s Florida Tutoring Advantage

      May 7, 2025

      UK hands Indian IT suppliers competitive boost in trade deal

      May 7, 2025

      Every Student Deserves High-Quality Computer Science Education

      May 7, 2025
      Latest Videos

      Cybersecurity has high scope in government jobs! (Tamil) | cyber security career

      May 8, 2025

      Why Pursue A Career In Digital Marketing?

      May 7, 2025

      Want to be a Certified Ethical Hacker? #ethicalhackingtraining#cybersecuritycourses #ethicalhacking

      May 6, 2025

      4 Best Courses to do before pursuing a Career in Finance

      May 5, 2025

      Kaunsa Course Sahi? #shortvideo #digitalmarketing #career

      May 4, 2025
      Latest Jobs

      Locum Physician (MD/DO) – Anesthesiology in Bemidji, MN

      May 9, 2025

      Registered Behavioral Technician (RBT) – Audubon School

      May 9, 2025

      Administrative Coordinator II

      May 8, 2025

      AICS Valuations, AVP

      May 8, 2025

      Testing Technical Project Manager

      May 8, 2025
      Legal
      • Home
      • Privacy Policy
      • Cookie Policy
      • Terms and Conditions
      • Disclaimer
      • Affiliate Disclosure
      • Amazon Affiliate Disclaimer
      Latest Udemy Coupons

      Mastering Maxon Cinema 4D 2024: Complete Tutorial Series | Udemy Coupons 2025

      August 22, 202434 Views

      Advanced Program in Human Resources Management | Udemy Coupons 2025

      April 5, 202530 Views

      Diploma in Aviation, Airlines, Air Transportation & Airports | Udemy Coupons 2025

      March 21, 202528 Views

      Time Management and Timeboxing in Business, Projects, Agile | Udemy Coupons 2025

      April 2, 202521 Views

      Digital Platforms and Ecosystems Business and Partnership | Udemy Coupons 2025

      March 29, 202520 Views
      Blog

      4 Phrases To Never Include On Your Resume

      May 8, 2025

      How To Start A Conversation With A LinkedIn Connection

      May 7, 2025

      8 Mistakes Companies Make During Layoffs

      May 4, 2025

      How To End Your Week On A Positive Note

      May 3, 2025

      How To Optimize Your LinkedIn Profile For Job Search Success

      May 2, 2025
      Facebook X (Twitter) Instagram Pinterest YouTube Dribbble
      © 2025 All rights reserved!

      Type above and press Enter to search. Press Esc to cancel.

      We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
      .
      SettingsAccept
      Privacy & Cookies Policy

      Privacy Overview

      This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
      Necessary
      Always Enabled
      Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
      Non-necessary
      Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
      SAVE & ACCEPT