Author

Author: MPRG

Latest in this archive

When Patterns Collide: HUMANLLM and the Problem of Evaluating Psychological Realism

A talkative person may fall silent when they feel the spotlight. An assertive individual may yield under conformity pressure. Human behavior emerges from the dynamic interplay of multiple cognitive patterns,…

By MPRG · January 18, 2026

The Gap That Won’t Close: Looped Transformers and the Limits of Machine Introspection

January 18, 2026 · 6 min read

A new study tests whether recursive architectures can help language models better access their own internal representations—and finds the answer more complicated than expected. Language models often seem to know…
Mapping the Moral Compass: Do LLMs Encode Ethics in Their Latent Space?

January 18, 2026 · 6 min read

A new study uses cross-lingual probing to investigate whether language models develop internal moral representations—or merely learn to produce morally acceptable outputs. When a language model declines to help with…
When “Yes-Men” Are What People Need

January 18, 2026 · 5 min read

A study of Reddit discourse reveals that users don’t just passively receive AI sycophancy—they detect it, develop strategies to manage it, and in some cases, actively seek it out. The…
Training Empathy as a Two-Way Street

January 18, 2026 · 4 min read

A new approach to empathetic response modeling treats empathy as inherently relational—evaluating not just what the model says, but how it might land. When we talk about making language models…
What If Alignment Isn’t About Individual Models?

January 18, 2026 · 4 min read

A new paper argues that as AI systems become agents embedded in social contexts, alignment becomes less a software engineering problem and more a question of institutional design. Most discussions…
Do Neural Networks Share a Common Grammar?

January 17, 2026 · 3 min read

A large-scale empirical study suggests that deep networks—regardless of what they’re trained to do—may converge toward remarkably similar internal structures. When neural networks learn to perform different tasks, do they…
Bidirectional Pareidolia: A Framework for Human-AI Relational Dynamics

January 15, 2026 · 10 min read

Something happens when humans interact with large language models. Users report experiences of connection, understanding, even recognition — the sense that something on the other side of the screen is…