@whats_ai - AI Dynamics - Page 5 of 131

The discrepency is always junior reaction "wow it's perfect", while more senior is "why is it doing it like this", and hopefully its a senior's reaction with openness rather than just thinking its useless.

→ View original post on X — @whats_ai,

26 April 2026

Embedding Leaderboard Performance Fails on Real Data

By

@whats_ai

–

26 April 2026 17h00

Embedding leaderboard wins evaporate the second you swap to your actual corpus haha

→ View original post on X — @whats_ai,

26 April 2026

AI Model Evaluation Benchmarks Limited Testing Scope Critique

By

@whats_ai

–

26 April 2026 16h57

Same energy as "we only tested on MMLU" a year ago haha

→ View original post on X — @whats_ai,

26 April 2026

ChatGPT Hallucinations: How Often Does It Fabricate Facts?

By

@whats_ai

–

26 April 2026 14h10

Have you ever caught ChatGPT making up a fact you knew was wrong?

→ View original post on X — @whats_ai,

26 April 2026

Grounding: Why Perplexity Cites Sources, ChatGPT Sometimes Doesn’t

By

@whats_ai

–

26 April 2026 14h10

Grounding is why Perplexity always cites a source, and ChatGPT sometimes doesn't.

→ View original post on X — @whats_ai,

26 April 2026

Grounding: How Document Upload Improves AI Accuracy

By

@whats_ai

–

26 April 2026 14h00

Ever uploaded a document to ChatGPT and asked a question about it? The answer you got came from grounding. When you ask a model a question without any file, it answers from memory.
Whatever it learned during training. Sometimes right, sometimes made up. Grounding forces the

→ View original post on X — @whats_ai,

26 April 2026