January 2025

The Allen Institute for AI (AI2) Releases Tülu 3 405B: Scaling Open-Weight Post-Training with Reinforcement Learning from Verifiable Rewards (RLVR) to Surpass DeepSeek V3 and GPT-4o in Key Benchmarks

Post-training techniques, such as instruction tuning and reinforcement learning from human feedback, have become essential for refining language models. But, open-source approaches often fall behind proprietary models due to a…

ai

Memorization vs. Generalization: How Supervised Fine-Tuning SFT and Reinforcement Learning RL Shape Foundation Model Learning

Modern AI systems rely heavily on post-training techniques like supervised fine-tuning (SFT) and reinforcement learning (RL) to adapt foundation models for specific tasks. However, a critical question remains unresolved: do…

techcrunch

Anthropic CEO Dario Amodei is trying to duck a deposition in an OpenAI copyright lawsuit

Anthropic CEO Dario Amodei is trying to avoid being deposed in a copyright lawsuit against OpenAI, according to new court filings. In response, lawyers for the plaintiff — the Authors…

techcrunch

CFPB fines fintech Wise, alleging it charged deceptive fees

The Consumer Financial Protection Bureau (CFPB) has hit UK-based remittance company Wise with about a $2 million fine for what it described as “a series of illegal actions.” Those actions…

techcrunch

Apple will pay $20M to settle Watch battery swelling suit, ‘denies wrongdoing’

Apple has agreed to pay $20 million to resolve a class-action lawsuit over battery swelling on the Apple Watch. Filed in the U.S. District Court for the Northern District of…

wired business

OpenAI’s o3-Mini Is a Leaner AI Model That Keeps Pace With DeepSeek

On the heels of DeepSeek R1, the latest model from OpenAI promises more advanced capabilities at a cheaper price.

wired business

Here’s How DeepSeek Censorship Actually Works—and How to Get Around It

A WIRED investigation shows that the popular Chinese AI model is censored on both the application and training level.

ai

OpenAI releases its new o3-mini reasoning model free

On Thursday, Microsoft announced that it’s rolling OpenAI’s reasoning model o1 out to its Copilot users, and now OpenAI is releasing a new reasoning model, o3-mini, to people who use…

techcrunch

Meta turns to solar — again — in its data center-building boom

The announcement comes as Meta CEO Mark Zuckerberg maintains the company’s ambitious AI strategy, which will require hefty capital investments in data centers. © 2024 TechCrunch. All rights reserved. For…

techcrunch

Stablecoins are finding product market fit in emerging markets

Five years ago, SpaceX launched Starlink, which has since grown into its biggest revenue driver, expanding to over 100 countries. But as Starlink scaled, it faced a major hurdle: accepting…

Breaking

The Allen Institute for AI (AI2) Releases Tülu 3 405B: Scaling Open-Weight Post-Training with Reinforcement Learning from Verifiable Rewards (RLVR) to Surpass DeepSeek V3 and GPT-4o in Key Benchmarks

Memorization vs. Generalization: How Supervised Fine-Tuning SFT and Reinforcement Learning RL Shape Foundation Model Learning

Anthropic CEO Dario Amodei is trying to duck a deposition in an OpenAI copyright lawsuit

CFPB fines fintech Wise, alleging it charged deceptive fees

Apple will pay $20M to settle Watch battery swelling suit, ‘denies wrongdoing’

OpenAI’s o3-Mini Is a Leaner AI Model That Keeps Pace With DeepSeek

Here’s How DeepSeek Censorship Actually Works—and How to Get Around It

OpenAI releases its new o3-mini reasoning model free

Meta turns to solar — again — in its data center-building boom

Stablecoins are finding product market fit in emerging markets

You missed

How Justin Ernest invested nearly $400M into hot startups without a traditional VC fund

GM joins race to build batteries for AI data centers and the grid

Hey Siri, here’s what I actually want from AI

Anthropic’s Fable 5 can make weirdly fun video games with the click of a button