BIG-Bench Benchmarks Face Validity Scrutiny for AGI Predictions - AI Dynamics

Skip to content

AI Dynamics

Global AI News Aggregator

Rechercher

BIG-Bench Benchmarks Face Validity Scrutiny for AGI Predictions

By

–

12 November 2022 21h29

Trouble in BIG-Bench paradise? – @ErnestSDavis looks at 48 of the benchmarks within and finds problems with most: https://
cs.nyu.edu/~davise/Benchm
arks/BigBenchDiscussion.html
… – Many project AGI timelines based on performance on these benchmarks. If the benchmarks aren’t valid, consequent timelines are problematic

→ View original post on X — @garymarcus

12 November 2022

AGI AI MARKET TRENDS RESEARCH

←Algorithm discovers specific symmetries from defined possibility space

Inductive bias guides learner function search in neural networks→

MORE ARTICLES

Paper praised for executing Gato idea with humanoid; more work desired

28 June 2026
Skild Brain AI enables robots to handle unfamiliar environments

28 June 2026
Proposal to replace Google Search with Gemini

28 June 2026
Using video to learn control representations, touch important

28 June 2026

INNOVATION GENERATIVE AI RESEARCH LLMS TOOLS MACHINE LEARNING CODE MARKET TRENDS TECHNOLOGY BUSINESS BIG TECH ETHICS ENTERPRISE AI SOFTWARE AGENTS AUTOMATION APPS COMPUTING DATA POLICY OPEN SOURCE MULTIMODAL AI REGULATION CULTURE CREATIVE AI PROMPT ENGINEERING SOCIETY ECONOMY SAFETY EDUCATION INVESTMENT AI HARDWARE AGI HARDWARE JOBS STARTUPS INDUSTRY ROBOTICS WORKFORCE SECURITY CYBERSECURITY HEALTHCARE AI SYSTEMS SUSTAINABILITY WEB3 DECENTRALIZED AI

AI Dynamics

Global AI News Aggregator

About
Archives
Contact

Rechercher