Quality News
new | top | ask | show | best | algorithms | about

Which LLMs fold under pressure? We made 6 LLMs argue 300 hard cases to find out (servanda.ai)
×5.16 | 7 points by luke14free 3 hours ago | 1 comments | ??? ???

Story Stats

This chart shows the history of this story's rank on the Hacker News "Top" (Front) Page, "New" Page, and "Best" Page, as well as its raw rank given the Hacker News ranking formula.

This chart shows the history of this story's upvotes compared to the expected upvotes for stories shown at the same ranks and times.

This chart shows the history of this story's estimated true upvote rate: the predicted long-term ratio of upvotes to expected upvotes.