Reinforcement Learning Course

Ai2's new Olmo 3.1 extends reinforcement learning training for stronger reasoning benchmarks

Ai2 updates its Olmo 3 family of models to Olmo 3.1 following additional extended RL training to boost performance.

A look under the hood of DeepSeek’s AI models doesn’t provide all the answers

A peer-reviewed paper about Chinese startup DeepSeek's models explains their training approach but not how they work through ...

Baseten Acquires Parsed to Enable Companies to Own Their Intelligence

The acquisition adds world-class reinforcement learning and post-training expertise to deliver superior inference quality and performance for Baseten customers via specialized intelligence SAN ...

InfoWorld

3 ways to get into reinforcement learning

Whether you like theoretical study or want to get your hands dirty, plenty of reinforcement learning resources are out there. When I was in graduate school in the 1990s, one of my favorite classes was ...

11d

AWS simplifies AI agent customization with automated reinforcement learning

A similar update is coming to Amazon SageMaker AI, which is a more advanced AI machine learning platform that allows ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results