

Apparently “emotionally mature” means “blushing schoolgirl”
Apparently “emotionally mature” means “blushing schoolgirl”
for a moment there i thought i’d been uninformed about the US threatening to annex California
Declaring black wins draws would be more in the spirit of how the game is actually played at high level. I don’t think anyone seriously considers the possibility that black could have a forced win in chess from the starting position.
They’re probably talking about Ziz’s group. The double homicide in Pennsylvania is likely the murder of Jamie Zajko’s parents referenced in this LW post, and the Vallejo county homicide is the landlord they had a fatal altercation with and who was killed recently.
That o3 does well on frontier math held-out set is impressive, no doubt
I think there is plenty of room for doubt still. elliotglazer on reddit writes:
Epoch’s lead mathematician here. Yes, OAI funded this and has the dataset, which allowed them to evaluate o3 in-house. We haven’t yet independently verified their 25% claim. To do so, we’re currently developing a hold-out dataset and will be able to test their model without them having any prior exposure to these problems.
My personal opinion is that OAI’s score is legit (i.e., they didn’t train on the dataset), and that they have no incentive to lie about internal benchmarking performances. However, we can’t vouch for them until our independent evaluation is complete.
(emphasis mine). So there is good reason to doubt that the “held-out dataset” even exists.
ok i watched Starship Troopers for the first time this year and i gotta say a whole lot of that movie is in fact hot people shooting bugs