SAFETY - AI Dynamics

Geoengineering and AI: A Solution for Climate Warming

By

–

04 November 2022 22h28

A new UN Climate Change report estimates we might be on track to 2.5°C of warming. It's time to seriously consider cooling down Earth by geoengineering using stratospheric aerosol injection, a solution in which AI will play a critical role. See why.

→ View original post on X — @andrewyng

4 November 2022

How to Make AI Safe: Key Solutions and Safeguards

By

@sama

–

04 November 2022 21h31

how could you make it safe? with a great answer to that, we'd definitely be open

→ View original post on X — @sama

4 November 2022

Inverse Scaling Prize Round 2 Evaluation Announcement

By

@_jasonwei

–

04 November 2022 19h55

Make sure to check out the inverse scaling prize, which is a great community effort! Looking forward to evaluating on the Round 2 winners 🙂

→ View original post on X — @_jasonwei

4 November 2022

Ethics of Releasing On-Device Stable Diffusion Applications

By

@steipete

–

04 November 2022 16h43

Interesting read; though releasing or not releasing a port to something that widely exists doesn’t really change anything, does it? https://
cephalopod.studio/blog/on-creati
ng-an-on-device-stable-diffusion-app-amp-deciding-not-to-release-it-adventures-in-ai-ethics
…

→ View original post on X — @steipete

4 November 2022

EU Cybersecurity Rules Mandate Aviation Suppliers Defend Flight Safety

By

@steve_rosenbush

–

04 November 2022 3h14

New cybersecurity rules in Europe will for the first time require a swath of aviation suppliers to identify and defend against hacking risks to flight safety #cybersecurity #EU #aviation #hacking ⁦
@catstupp
⁩ https://
wsj.com/articles/eu-ex
pands-cyber-rules-for-airline-flight-safety-11667402005
…

→ View original post on X — @steve_rosenbush

4 November 2022

Unlearnable Examples: False Security Against Future AI Systems

By

@thegautamkamath

–

03 November 2022 20h15

Looks cool! However there's a critique that's been applied to unlearnable examples (
https://
arxiv.org/abs/2106.14851 ft @sanghyun_hong @florian_tramer
). Doesnt this give a false sense of security? Once released, future systems will be able to evade whatever defense is applied to the image

→ View original post on X — @thegautamkamath

3 November 2022

MIT immunizes photos against AI-powered misinformation edits

By

@mit_csail

–

03 November 2022 19h42

Last week @Trevornoah asked @OpenAI @miramurati
: How can we safeguard against AI-powered photo editing for misinformation? MIT students hacked a way to "immunize" photos against edits: http://
gradientscience.org/photoguard/ @aleks_madry

→ View original post on X — @mit_csail

3 November 2022

Ethics of accusation: avoiding adversarial framing risks

By

@thegautamkamath

–

03 November 2022 17h52

This one is at least feasible. But I'd still personally refrain from accusing anyone, since it's easy to frame someone adversarially or even accidentally.

→ View original post on X — @thegautamkamath

3 November 2022

Security Vulnerability Identified as Potential Attack Vector

By

@thegautamkamath

–

03 November 2022 17h17

Yeah, it does look pretty bad. But also seems like a nice attack vector 🙂

→ View original post on X — @thegautamkamath

3 November 2022

LLMs Generate Convincing but Wrong Completions Often

By

@thom_wolf

–

03 November 2022 12h08

this is an issue I also have with copilot. it often very convincingly pushes for long and beautiful but wrong completions LLM as “internet influencers”: surface form trumps content https://
x.com/shortstein/sta
/shortstein/status/1587857803678748672
…

→ View original post on X — @thom_wolf

3 November 2022