Search results for "large"

Found 17 results (9 tools · 8 articles). Sorted by relevance by .

AI Tools (9)

LiblibAI: China's largest AI image generation platform and model community of 25 million creators

IBlibAI is an AI image generation platform established in March 2023, focusing on the creation and sharing of original AI painting models. The platform provides one-stop creative tools such as online image generation, model download and training, video generation, special effects templates, AI

FlagEval: The internationally authoritative large model evaluation system and Libra open platform

FlagEval (Libra) is a large-scale model evaluation system and open platform initiated by Beijing Zhiyuan Artificial Intelligence Research Institute, aiming to establish scientific, fair, and open evaluation benchmarks and methods. The platform has innovatively constructed a three-dimensional

AGI Eval: A large model evaluation community and authoritative third-party evaluation platform

AGI Eval is a large model evaluation community jointly created by top universities and institutions such as Shanghai Jiao Tong University, Tongji University, East China Normal University, and DataWhale, with the mission of "assisting evaluation and making AI a better partner for humanity". The

CMMLU: Authoritative Chinese Language Model Knowledge Understanding Ability Evaluation Benchmark

CMMLU (Chinese Massive Multitask Language Understanding) is a large-scale language understanding benchmark designed specifically for the Chinese language context, covering 67 subject topics from beginner to advanced professional levels, including natural sciences, social sciences, engineering

HELM: Stanford University led comprehensive evaluation framework for large language models and high

HELM (Holistic Evaluation of Language Models) is a comprehensive language model evaluation framework initiated by the Stanford University Center for Fundamental Model Research (CRFM), aimed at systematically evaluating large language models through multidimensional, standardized, and reproducible

MMLU: The International Authoritative Benchmark for Multi Task Language Understanding Ability of

MMLU (Massive Multitask Language Understanding) is a large-scale multi task language understanding evaluation dataset jointly released by the University of California, Berkeley and other institutions. It covers 57 disciplinary fields, including humanities, social sciences, natural sciences,

A Quick Take: Just how much hassle can that AI meeting note-taker hidden inside Baidu Netdisk really save you?

"Jiandan Tingji" is an AI-powered speech-to-text tool launched by Baidu Netdisk. Integrated with the ERNIE Bot (Wenxin Yiyan) large language model, it supports the transcription of meetings, interviews, and lectures, as well as the intelligent generation of meeting minutes, achieving an accuracy rate of up to 97%. It is available across all platforms, with a continuous monthly subscription priced at 25 yuan.

OpenCompass: An Open Source Large Model Comprehensive Evaluation System and Sinan Open Platform

OpenCompass is an open-source large model evaluation system launched by Shanghai Artificial Intelligence Laboratory, providing one-stop evaluation services for large language models, multimodal models, and scientific intelligence models. The platform supports one click distributed evaluation of

LangGPT - Structured Prompt Word Design Framework: Natural Language Programming for Large Language

LangGPT is a structured prompt word design framework inspired by programming languages, aimed at solving the problem of non AI experts having difficulty writing high-quality prompt words. Drawing on the design ideas of object-oriented programming languages, LangGPT proposed a dual layer structure

AI News (8)

OpenAI's GPT-5.6 series preview remains in internal testing, with proprietary inference chips set for mass production by year-end.

OpenAI is officially advancing the limited beta testing of its GPT-5.6 model series and has announced a partnership with Broadcom to co-develop "Jalapeño," a dedicated inference ASIC. The project targets tape-out within nine months and large-scale deployment by the end of 2026, aiming to slash inference costs by 50% and effectively completing the company's vertically integrated strategy combining models and chips.

Anthropic's valuation has reached $965 billion, surpassing OpenAI for the first time.

The AI industry is undergoing a seismic shift as Anthropic’s post-investment valuation climbs to $965 billion, surpassing OpenAI for the first time. Its Claude Opus 4.8 model has reached the pinnacle of reasoning capabilities, while Claude Code captures over half the code-focused AI market with $6.3 billion in annual revenue; this analysis delves into the company's three core strengths: capital, technology, and commercialization.

How do you view Minimax's new M3 multimodal model and updated Token Plan?

I originally wanted to say 'M3 is very strong' or 'just like that' directly, but later I found out that it's not something that can be explained in a single sentence. Let me admit one thing first: I have a bias against Minimax. It's not the kind of malice, it's the prejudice of being 'tricked'.

Sun Zhengyi was the richest man in Asia again, and fell again three days later: how to understand

To be honest, my first reaction when I saw this news was not "wow", but "is it again?" - because the last time he was so successful was in 2017. After that, everyone knows the story: WeWork went bankrupt, SoftBank Vision Fund suffered huge losses, his personal wealth once shrank to $21.1 billion,

The wave of mandatory retirement is coming! Codex launches GPT-5.2/5.3-Code in June, developers

The independent variable robot releases the world's first "event level prediction" embodied

Breaking the limitations of traditional frame by frame learning, the way robots understand tasks has entered a new stage

Out of control and overturned! Google AI unauthorized tampering with production code, resulting in

Meituan Open-Sources LongCat-2.0, a 1.6-Trillion-Parameter MoE Model; Full-Stack Execution on Domestic Computing Hardware Successfully Achieved

Meituan has officially open-sourced LongCat-2.0, a massive MoE model with 1.6 trillion parameters. This release achieves end-to-end training and inference deployment on a domestic computing cluster comprising 50,000 GPUs, thereby breaking reliance on overseas computing power. The announcement details LongCat-2.0's technical highlights, performance advantages, and industry value.