2Making AI chatbots friendly leads to mistakes and support of conspiracy theories (opens in new tab)(theguardian.com)93Cynddl11d ago80
3UK Biobank health data keeps ending up on GitHub (opens in new tab)(biobank.rocher.lc)197Cynddl17d ago57
4ChatGPT Edu feature reveals researchers' project metadata across universities (opens in new tab)(fastcompany.com)2Cynddl1mo ago0
5AI no better than other methods for patients seeking medical advice, study shows (opens in new tab)(reuters.com)3Cynddl3mo ago0
6AI chatbots pose 'dangerous' risk when giving medical advice, study suggests (opens in new tab)(bbc.co.uk)4Cynddl3mo ago2
7Show HN: Small, anonymous app for teams to do retrospective sessions (opens in new tab)(retrospective.rocher.lc)1Cynddl3mo ago0
8Measuring What Matters: Construct Validity in Large Language Model Benchmarks (opens in new tab)(arxiv.org)1Cynddl6mo ago0
9AI Capabilities May Be Overhyped on Bogus Benchmarks, Study Finds (opens in new tab)(gizmodo.com)43Cynddl6mo ago17
10AI's capabilities may be exaggerated by flawed tests, according to new study (opens in new tab)(nbcnews.com)3Cynddl6mo ago0
11Experts find flaws in tests that check AI safety and effectiveness (opens in new tab)(theguardian.com)3Cynddl6mo ago0
12Measuring What Matters: Construct Validity in Large Language Model Benchmarks (opens in new tab)(oxrml.com)3Cynddl6mo ago2