1ChatGPT Edu feature reveals researchers' project metadata across universities (opens in new tab)(fastcompany.com)2Cynddl13d ago0
2AI no better than other methods for patients seeking medical advice, study shows (opens in new tab)(reuters.com)3Cynddl1mo ago0
3AI chatbots pose 'dangerous' risk when giving medical advice, study suggests (opens in new tab)(bbc.co.uk)4Cynddl1mo ago2
4Show HN: Small, anonymous app for teams to do retrospective sessions (opens in new tab)(retrospective.rocher.lc)1Cynddl1mo ago0
5Measuring What Matters: Construct Validity in Large Language Model Benchmarks (opens in new tab)(arxiv.org)1Cynddl4mo ago0
6AI Capabilities May Be Overhyped on Bogus Benchmarks, Study Finds (opens in new tab)(gizmodo.com)43Cynddl4mo ago17
7AI's capabilities may be exaggerated by flawed tests, according to new study (opens in new tab)(nbcnews.com)3Cynddl4mo ago0
8Experts find flaws in tests that check AI safety and effectiveness (opens in new tab)(theguardian.com)3Cynddl4mo ago0
9Measuring What Matters: Construct Validity in Large Language Model Benchmarks (opens in new tab)(oxrml.com)3Cynddl4mo ago2
11Facial recognition works better in the lab than on the street, researchers show (opens in new tab)(theregister.com)4Cynddl7mo ago1
12We Shouldn't Trust Facial Recognition's Glowing Test Scores (opens in new tab)(techpolicy.press)2Cynddl7mo ago0
13Training language models to be warm and empathetic makes them less reliable (opens in new tab)(arxiv.org)358Cynddl7mo ago375
14AI's limited understanding of gender puts health equity at risk (opens in new tab)(oii.ox.ac.uk)4Cynddl10mo ago0
15Establishing meaningful data access for algorithm audits (opens in new tab)(syntheticsociety.oii.ox.ac.uk)1Cynddl1y ago0