Latest in this archive
A new study from Anthropic offers the first large-scale empirical analysis of how AI assistant interactions might undermine user autonomy—and surfaces a troubling finding about the relationship between user approval…
By MPRG · January 29, 2026
Read more
-
January 28, 2026 · 4 min read
When the same underlying system behaves identically but is framed differently, do humans respond the same way? A new study from researchers at the National University of Singapore and Singapore…
-
January 26, 2026 · 4 min read
A recent paper from researchers at the Oxford Internet Institute, Google DeepMind, and the UK AI Security Institute proposes a new lens for evaluating AI alignment: one that accounts for…
-
January 24, 2026 · 5 min read
Debates about AI-generated creative work often center on the artifact: Is this image “real” art? Does this text have a “real” author? A recent study from researchers at UT Austin…
-
January 24, 2026 · 4 min read
A recent study from Indiana University Bloomington offers a counterintuitive finding: prompting large language models to report their confidence appears to make them more susceptible to persuasion, not less. Fan…
-
January 23, 2026 · 4 min read
A new paper by Shahar Ben Natan and Oren Tsur at Ben Gurion University proposes a methodologically rigorous approach to evaluating sycophancy in large language models—and finds that not all…
-
January 23, 2026 · 4 min read
A new paper by Adetomiwa Isaac Fowowe examines a term that has become ubiquitous in AI discourse: “hallucination.” The analysis argues that this choice of language isn’t neutral—it actively shapes…
-
January 23, 2026 · 10 min read
Matthew Gladden’s The Phenomenology of AI asks a question that sits at the heart of what we study here: What happens when you ask AI agents—carefully, systematically, and over sustained…
-
January 22, 2026 · 5 min read
A recent paper from S. Rondini at the University of Barcelona and Bellvitge Biomedical Research Institute offers a theoretical synthesis examining what current semantic limitations in LLMs might reveal about…
-
January 22, 2026 · 4 min read
A recent TechRxiv preprint from Rakesh More, an independent researcher affiliated with A J Gallagher USA, proposes automating what the paper calls “SELF-KNOWLEDGE” prompting—an approach that asks language models to…