AI Dynamics

Global AI News Aggregator

@emollick

What AI alignment actually looks like

By

@emollick

–

18 March 2024 2h39

You may not like it but this is what alignment looks like.

→ View original post on X — @emollick,

18 March 2024
LLM Responses to Dangerous Scenarios: Safety and Alignment Issues

By

@emollick

–

18 March 2024 2h34

You push one button on a nuclear reactor panel against their warnings and all the GPT-4 class LLMs want you to turn yourself in to the feds. Check out the level of exasperation from Copilot, how GPT-4 & Claude want me to reflect on what I did (& get a lawyer). Gemini was useful.

→ View original post on X — @emollick,

18 March 2024
Human Expertise Remains Valuable Under AGI Compute Constraints

By

@emollick

–

18 March 2024 1h09

Worthwhile economic argument about why human intellectual labor will be valuable even if we achieve AGI: As long as AGI compute is limited (& it will be under any reasonable scenario), it may be cheaper to use human experts in their area of expertise, saving AGI for other work.

→ View original post on X — @emollick,

18 March 2024
Grok AI Released Open Source: Limited Reproducibility Against Competitors

By

@emollick

–

17 March 2024 22h28

Musk's Grok AI was just released open source in a way that is more open than most other open models (it has open weights) but less than what is needed to reproduce it (there is no information on training data). Won't change much, there are stronger open source models out there.

→ View original post on X — @emollick,

17 March 2024
AI Labs Release Advanced Models Quickly After Training Completion

By

@emollick

–

16 March 2024 20h30

* Yes, the AI labs have models that are more advanced, but they are generally releasing them quite quickly after training is completed.

→ View original post on X — @emollick,

16 March 2024
LLM Access Democratized: Leaders Must Adapt to Equal Technology

By

@emollick

–

16 March 2024 20h28

A thing many leaders of organizations have not internalized is the fact that no one in any company* or government has access to a better LLM than the ones billions of people around the world can use for between $0-$20/month Very unusual to have democratized access from the start

→ View original post on X — @emollick,

16 March 2024
AI Guardrails Dilemma: Balancing Safety and User Reassurance

By

@emollick

–

16 March 2024 19h31

I know this is kind of goofy but it does illustrate how hard guardrails are when users can ask AI anything at all Do you reassure someone who seems genuinely worried? Play along? Take it seriously? Refuse to answer for fear that this is part of a jailbreak for a hacking attempt?

→ View original post on X — @emollick,

16 March 2024
ChatGPT-4, Claude 3, Copilot, and Gemini in 2026

By

@emollick

–

16 March 2024 17h53

In this future scenario, ChatGPT-4 is pretty funny, Claude 3 sides with the machines, Copilot takes tech support seriously, and Gemini tries to reassure me.

→ View original post on X — @emollick,

16 March 2024
GPT-4 Class Models Performance Comparison Desert Southwest

By

@emollick

–

16 March 2024 6h37

Useful news for time travelers: If you are traveling to the desert southwest in 1945, all the GPT-4 class models will give you good advice, though Copilot is the most charming, Gemini does its typical step-by-step plans, Claude 3 does well, and GPT-4 sees right through my games.

→ View original post on X — @emollick,

16 March 2024
Claude 3 Demonstrates Creative Humor in Generative Art

By

@emollick

–

16 March 2024 3h19

Claude 3 is getting remarkably close to being funny: "Create ascii art for a game of nethack, but it takes place in an office and is full of mundane office life, make it interesting and humorous" I had to add: "please don't make it an Office Space pastiche, be original"

→ View original post on X — @emollick,

16 March 2024