新鮮でリアルなコンテンツで自然に言語を学ぼう！

地域別に探す

AI スタートアップの Galileo Technologies は、Claude 3.5 Sonnet、Google の Gemini、Alibaba の Qwen2-72B-Instruct を Hallucination Index ベンチマークでトップにランク付けしました。 AI startup Galileo Technologies ranks Claude 3.5 Sonnet, Google's Gemini, and Alibaba's Qwen2-72B-Instruct top in the Hallucination Index benchmark.

flag AI スタートアップの Galileo Technologies は、新しいベンチマークテストである Hallucination Index で、中規模およびオープンソースの大規模言語モデルを高く評価しました。 flag AI startup Galileo Technologies has ranked midrange and open-source large language models highly in a new benchmark test, the Hallucination Index. flag このベンチマークでは、22 の主要な生成 AI モデルを評価し、3 つのタスクコレクションにわたってその精度を測定しました。 flag The benchmark, which evaluates 22 leading generative AI models, measured their accuracy across three task collections. flag Anthropic の Claude 3.5 Sonnet がランキングのトップとなり、Google の Gemini 1.5 Flash がコスト面で最高のパフォーマンスを発揮しました。 flag Anthropic's Claude 3.5 Sonnet topped the ranking, while Google's Gemini 1.5 Flash performed best on cost. flag Alibaba の Qwen2-72B-Instruct は、最高のパフォーマンスを発揮したオープンソースモデルでした。 flag Alibaba's Qwen2-72B-Instruct was the top-performing open-source model.

3 記事

記事

SiliconANGLE

SD Times

PYMNTS.com

-- 表示を減らす --

さらに読む

Sonnet

FLASH

Generative AI

Anthropic

人気のトピック

地域別に探す

記事

さらに読む

関連記事