How Research Evolves: From RL Robotics to Direct Chatbot Optimization

AI Dynamics

Global AI News Aggregator

How Research Evolves: From RL Robotics to Direct Chatbot Optimization

–

03 June 2024 2h47

How research works:
– John Schulman did his PhD on reinforcement learning for robotics.
– Then he went to OpenAI and applied it to GPT-3, giving us ChatGPT.
– Then other researchers found there's no need for RL, because you can directly optimize chatbots to please their users.
So

→ View original post on X — @pmddomingos,

3 June 2024

AI ETHICS GENERATIVE AI INNOVATION LLMS MACHINE LEARNING RESEARCH ROBOTICS

AI Dynamics

How Research Evolves: From RL Robotics to Direct Chatbot Optimization

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

AI Generates Perfect Jokes Using Image Generation Skills

Codex App Transformation: Atlas Integration Reshapes User Experience

AI File Access Limitations: Screenshot vs Disk Storage Issues

Synthetic Aperture Radar: Satellite Tech for Global Monitoring