SAFETY - AI Dynamics

Organizational AI adoption challenges and malicious threat mitigation

By

–

12 November 2022 0h18

Yeah, and don't get me wrong, this is very cool work! And I know you folks know these things. But
1. I think there's a lot of people who don't and just hope for a "silver bullet" and
2. Getting organizational buy-in seems tricky, particularly with malicious parties out there.

→ View original post on X — @thegautamkamath

12 November 2022

Open Source AI Models and Regulatory Frameworks Need Society Agreement

By

@thegautamkamath

–

12 November 2022 0h10

I'm also cautious about rogue model designers who don't play by the rules. At least for the time being, it seems like the community is very good at making open source alternatives to proprietary models. We'd need to agree as a society that certain things should be off limits.

→ View original post on X — @thegautamkamath

12 November 2022

Model Designers and Users Share AI Safety Responsibility

By

@thegautamkamath

–

12 November 2022 0h04

Definitely! With cooperation from model designers, there's hope. But like I said, if all the burden is on the users alone? Then I'm not so hopeful.

→ View original post on X — @thegautamkamath

12 November 2022

Rich People Power and Organizational Resistance Strategies

By

@clementdelangue

–

11 November 2022 23h46

Let's say no in case one of them is listening 😉 More seriously whether you're a for-profit, non-profit, public or private, the truth is that rich people can always mess with you. Hopefully they won't and we'll try to resist if they do!

→ View original post on X — @clementdelangue

11 November 2022

Image Sanitization Safety Against Advanced AI Models

By

@thegautamkamath

–

11 November 2022 20h12

Nice article by @KyleBarr5
. I gave my perspective (really, the perspective of Radiya-Dixit et al. https://
arxiv.org/abs/2106.14851): users sanitizing and releasing their own images seems hopeless. It may be safe today, but it probably won't be safe against a smarter model a year from now.

→ View original post on X — @thegautamkamath

11 November 2022

AI Impersonation Service Creates Identity Fraud Problems

By

@simonw

–

11 November 2022 17h58

Who could have possibly predicted that selling the ability to impersonate anyone for a few hours for $8 would result in problems with "impersonation issues"?

→ View original post on X — @simonw

11 November 2022

CDEIUK Launches Red-Teaming Recruitment for Privacy-Enhancing Technologies

By

@lawrennd

–

11 November 2022 9h08

The @CDEIUK has also launched it's recruitment process for red-teaming these projects. https://
ktn-uk.org/news/red-team-
registration-uk-privacy-enhancing-technologies-challenge-prize/
…

→ View original post on X — @lawrennd

11 November 2022

Future Fund crisis threatens NeurIPS ML Safety workshop grants

By

@thegautamkamath

–

11 November 2022 4h04

"…there are many committed grants that the Future Fund will be unable to honor." The #NeurIPS2022 ML Safety Workshop (
https://
neurips2022.mlsafety.org) had an eye-watering $100k of awards for best papers and AI risk analyses. Given all the turmoil, I presume these are getting the axe?

→ View original post on X — @thegautamkamath

11 November 2022

Increased proliferation risk estimate and fast takeoff concerns

By

@id_aa_carmack

–

11 November 2022 2h52

I have increased my estimate of the proliferation risk, which does indirectly increase the risk of fast takeoff, but my constant factor for the danger is still quite low.

→ View original post on X — @id_aa_carmack

11 November 2022