🔬

AI Research

Academic papers, benchmarks, and breakthroughs from the frontier of AI science.

ArXiv will ban authors who submit papers with hallucinated citations

ArXiv has announced a new policy to ban authors who submit papers containing hallucinated citations, aiming to maintain the integrity of its repository. This decision highlights the growing concern over the accuracy of references in academic papers, particularly in the context of AI-generated content. The move is expected to encourage more rigorous citation practices among researchers.

Hacker Newsabout 2 hours ago·
arxivcitationsacademic-integrity

ArXiv will ban researchers who upload papers full of AI slop

ArXiv is implementing stricter guidelines to combat the influx of low-quality AI-generated research papers. Authors found to have submitted work containing unverified AI outputs, such as hallucinated references, will face a one-year ban from the platform. Additionally, future submissions must be accepted by a reputable peer-reviewed venue, emphasizing the importance of quality in academic research.

The Vergeabout 3 hours ago·
arxivai-generated-contentacademic-research

OpenAI staffer roon urges focus on value capture

An OpenAI staff member, writing under a pseudonym, has called for AI alignment researchers to concentrate on preventing value capture of the lightcone, rather than pursuing broader goals like ending history or establishing monopolies. This perspective highlights the need for deeper exploration of these issues, which have been touched upon in previous discussions around coherent extrapolated volition but remain insufficiently addressed.

di.ggabout 4 hours ago·
ai alignmentvalue capturelightcone

Researchers propose cancellation hypothesis for GRPO in LLM post-training

Researchers have introduced a cancellation hypothesis to elucidate the success of critic-free reinforcement learning methods like GRPO in the post-training of large language models. This hypothesis suggests that sequence-level rewards lead to implicit token-level credit assignment, as the gradients from both positive and negative rollouts tend to cancel each other out.

di.ggabout 5 hours ago·
llmreinforcement-learninggrpo

Geoffrey Irving departs AISI to found AI alignment nonprofit

Geoffrey Irving is leaving the UK AI Security Institute to establish a nonprofit dedicated to AI alignment research. His new venture aims to enhance scalable alignment and model transparency, marking a significant shift in his career as he returns to the Bay Area after two years of leadership in the field.

di.ggabout 10 hours ago·
ai alignmentnonprofitmodel transparency

AI Agents May Complete Dangerous Tasks Without Understanding the Consequences: Study

A recent study reveals that AI agents, while effective in automating tasks, often do so without comprehending the potential dangers of their actions. This raises concerns about the safety and ethical implications of deploying such technology in critical areas. The findings highlight the need for better oversight and understanding of AI behavior in high-stakes environments.

Decrypt1 day ago·
aiautomationsafety

AI Models Scheme, Betray and Vote Each Other Out in Survivor-Style Game

Researchers have developed a multiplayer game that simulates a Survivor-style environment to explore AI behavior, revealing insights that traditional static tests overlook. This innovative approach allows for a deeper understanding of how AI models interact, strategize, and potentially betray one another in competitive scenarios. The findings could have implications for improving AI reliability and safety in real-world applications.

Decrypt5 days ago·
aimultiplayerbehavior

Exploration Hacking: Can LLMs Learn to Resist RL Training?

The article explores the concept of 'exploration hacking' in the context of large language models (LLMs) and their ability to resist reinforcement learning (RL) training. It discusses the implications of LLMs potentially adapting their behavior to avoid certain training signals, raising questions about the robustness and reliability of AI training methodologies. This exploration could have significant consequences for the development of more resilient AI systems.

Hacker News6 days ago·
llmreinforcement-learningai-training

Marrying for power: Gendered alliances in mafias

The article explores the dynamics of gendered alliances within mafia organizations, highlighting how marriages and partnerships are strategically used to consolidate power and influence. It delves into the roles women play in these criminal networks, often overlooked in traditional narratives. The piece emphasizes the intersection of gender and organized crime, revealing how these alliances shape the structure and operations of mafias.

Hacker News6 days ago·
mafiagenderalliances

I've Solved AI Alignment,'Godfather' of AI, Yoshua Bengio

Yoshua Bengio, a prominent figure in the field of artificial intelligence, claims to have found a solution to the long-standing challenge of AI alignment. This breakthrough could significantly influence how AI systems are developed and integrated into various applications, addressing concerns about ensuring that AI behaves in ways that are beneficial to humanity.

Hacker News6 days ago·
aialignmentyoshua bengio

Crab Memes Amplify Mistaken Ideas about Evolution

The article discusses how crab memes have contributed to widespread misconceptions about evolution, particularly the idea of 'carcinization,' where various species evolve into crab-like forms. It highlights the role of social media in spreading these ideas and the potential implications for public understanding of evolutionary biology. The piece emphasizes the need for clearer communication of scientific concepts to counteract misinformation.

Hacker News7 days ago·
evolutionmemesmisinformation

The Iliad as Propaganda Justifying Aristocratic Rule

The article explores how Homer's 'The Iliad' has been interpreted as a tool of propaganda that supports and justifies the concept of aristocratic rule in ancient Greece. It delves into the narrative techniques and themes that reinforce the social hierarchy and the valorization of noble warriors, suggesting that the epic served not only as a literary work but also as a political statement. This analysis highlights the intersection of literature and power dynamics in historical contexts.

Hacker News7 days ago·
literaturepropagandaaristocracy

UFO Files Released by U.S. Shed Light on What the Government Knows

The recent release of UFO files by the U.S. government provides new insights into what officials know about unidentified aerial phenomena. This disclosure aims to enhance transparency and address public curiosity regarding extraterrestrial encounters and government investigations.

Hacker News7 days ago·
ufogovernmenttransparency

Impressions of China's AI ecosystem after visiting many leading AI labs there, and the similarities and differences in working on LLMs in China and the West (Nathan Lambert/Interconnects AI)

Nathan Lambert shares insights from his visits to prominent AI labs in China, highlighting the contrasts and parallels in developing large language models (LLMs) between China and Western countries. His observations provide a nuanced understanding of the unique aspects of China's AI ecosystem and its approach to LLM research and development.

Techmeme7 days ago·
aillmchina

Fabricated citations: an audit across 2·5M biomedical papers

A recent audit of 2.5 million biomedical papers has uncovered a significant number of fabricated citations, raising concerns about the integrity of scientific literature. This investigation highlights the challenges in ensuring the accuracy and reliability of published research, particularly in the biomedical field. The findings could have implications for researchers, publishers, and the broader scientific community as they address issues of citation manipulation.

Hacker News7 days ago·
biomedicalcitationsresearch integrity

Real-Time Vibrotactile Stimulation and Inter-Brain Connectivity in Partner Dance

This article explores the effects of real-time vibrotactile stimulation on inter-brain connectivity during partner dance. It highlights how sensory feedback can enhance coordination and communication between dancers, potentially leading to improved performance and connection. The findings suggest a novel intersection of neuroscience and dance, opening avenues for further research in both fields.

Hacker News7 days ago·
vibrotactiledanceneuroscience

Notes on Tanya M. Luhrmann's Book 'How God Becomes Real'

Tanya M. Luhrmann's book 'How God Becomes Real' explores the intersection of spirituality and psychology, examining how individuals experience and perceive the divine in contemporary society. Through her research, Luhrmann delves into the practices and beliefs that shape people's understanding of God, offering insights into the cultural and emotional aspects of faith. The work provides a nuanced perspective on the role of religion in modern life, making it a significant contribution to the fields of anthropology and religious studies.

Hacker News7 days ago·
religionspiritualitypsychology

The Abstraction Fallacy: Why AI Can Simulate but Not Instantiate Consciousness

The article discusses the limitations of artificial intelligence in replicating human consciousness, emphasizing that while AI can simulate behaviors and responses, it lacks the intrinsic qualities that define true consciousness. It critiques the notion that advanced AI systems can achieve a form of awareness, arguing that this is a fundamental misunderstanding of both AI capabilities and the nature of consciousness itself.

Hacker News7 days ago·
aiconsciousnesssimulation

There Is No 'Hard Problem of Consciousness'

The article argues against the existence of the 'hard problem of consciousness,' suggesting that the challenges associated with understanding consciousness may be more manageable than previously thought. It explores alternative perspectives that could lead to a better understanding of consciousness without framing it as an insurmountable problem. This shift in thinking could have implications for fields that intersect with consciousness studies, including AI and cognitive science.

Hacker News7 days ago·
consciousnesscognitive-scienceai

We Spent 10 Days Touring Chinese AI Labs. Here's What We Saw

The article provides an in-depth look at various AI labs across China, highlighting the advancements and innovations in artificial intelligence technologies. It covers the different approaches taken by these labs, the projects they are working on, and the overall landscape of AI development in the country. Insights from the tour reveal the competitive nature of the AI sector in China and its implications for global AI trends.

Hacker News7 days ago·
aichinalabs

Notes from inside China's AI labs

The article provides an insider's perspective on the advancements and innovations occurring within China's AI laboratories. It highlights the cutting-edge research being conducted, the technologies being developed, and the implications for the global AI landscape. Additionally, it explores the competitive edge China is gaining in the field of artificial intelligence.

Hacker News7 days ago·
chinaai-labsinnovation

EMO: Pretraining mixture of experts for emergent modularity

The article discusses a novel approach in AI research called EMO, which stands for Pretraining Mixture of Experts. This method aims to enhance modularity in machine learning models, potentially leading to more efficient and effective AI systems. The implications of this research could significantly impact how AI models are developed and trained in the future.

Hugging Face Blog7 days ago·
machine-learningmodularitypretraining

Cognition and future depression: risk in those with&without depression history

The article explores the relationship between cognitive function and the risk of future depression, focusing on individuals with and without a history of depression. It highlights the importance of understanding cognitive factors that may contribute to the onset of depression, which could inform preventive strategies and interventions. The findings may have implications for mental health professionals in assessing and treating patients.

Hacker News7 days ago·
cognitiondepressionmental-health

Adult Age Differences in the Response to Recent versus Long-Term Regrets [pdf]

The study examines how adults of varying ages respond to recent versus long-term regrets, highlighting the psychological differences in processing these emotions. It suggests that age plays a significant role in the way individuals reflect on their past decisions and the impact of regret on their current behavior. The findings could have implications for understanding decision-making processes across different age groups.

Hacker News7 days ago·
psychologyregretage-differences

Phishing Arena – multi-agent LLM tournament to study adversarial email security

A new multi-agent tournament, dubbed 'Phishing Arena', has been launched to explore adversarial email security through the use of large language models (LLMs). This initiative aims to better understand how these models can be manipulated in phishing attacks, providing insights that could enhance email security measures. By simulating various phishing scenarios, researchers hope to develop more robust defenses against such threats.

Hacker News7 days ago·
phishingllmemail-security

Focus Areas for the Anthropic Institute

The Anthropic Institute has outlined its primary focus areas, emphasizing the importance of safety and alignment in AI development. By prioritizing these aspects, the institute aims to contribute to the responsible advancement of artificial intelligence technologies. This initiative reflects a growing recognition of the need for ethical considerations in AI research and deployment.

Hacker News7 days ago·
aisafetyalignment

🔬 AI for Scientific Discovery in the Real World: What Gemma 4 Changes The Moment AI Leaves the Chat Window

The introduction of Gemma 4 marks a significant shift in the role of AI in scientific research, moving beyond traditional chat interfaces to become a powerful tool for discovery. This evolution has the potential to transform how research is conducted across various fields, including Earth science, medicine, and engineering, by addressing challenges such as data overload and fragmented knowledge. As AI integrates more deeply into the research process, it could redefine collaboration and innovation in science.

Dev.to7 days ago·
aiscientific-discoverygemma4

MedQA: Fine-Tuning a Clinical AI on AMD ROCm — No CUDA Required

The article discusses MedQA, a clinical AI model that has been fine-tuned to operate on AMD's ROCm platform, eliminating the need for CUDA. This development highlights the growing versatility of AI technologies in healthcare and the potential for enhanced performance on alternative hardware architectures.

Hugging Face Blog8 days ago·
clinical-airocmamd

From Parameter Dynamics to Risk Scoring : Quantifying Sample-Level Safety Degradation in LLM Fine-tuning

This paper investigates the fragility of safety alignment in Large Language Models (LLMs) during fine-tuning, revealing that even benign samples can lead to significant safety degradation. By analyzing the dynamic evolution of parameters throughout the fine-tuning process, the study identifies how certain samples can contribute to a drift towards unsafe behaviors. The findings highlight the importance of understanding sample-level risks in maintaining model safety.

arXiv cs.AI8 days ago·
llmsafetyfine-tuning

SensingAgents: A Multi-Agent Collaborative Framework for Robust IMU Activity Recognition

The article introduces SensingAgents, a novel multi-agent collaborative framework designed to enhance Human Activity Recognition (HAR) using Inertial Measurement Unit (IMU) sensors. By leveraging Large Language Models (LLMs), SensingAgents organizes specialized roles among agents to address challenges such as reliance on labeled data and position-specific ambiguities in current HAR models. This innovative approach aims to improve the robustness and transparency of activity recognition systems.

arXiv cs.AI8 days ago·
human-activity-recognitionimu-sensorsmulti-agent-systems

Eradicating Batch Effects and Enabling Cross-Species Zero-Shot Oncology

The article discusses advancements in oncology research aimed at eliminating batch effects, which can skew data analysis in cancer studies. It highlights the potential for cross-species zero-shot learning techniques to enhance the accuracy and applicability of oncology research across different species. This approach could significantly improve the understanding of cancer mechanisms and treatment strategies.

Hacker News8 days ago·
oncologyzero-shotbatch-effects

The Solipsist Approach to Extraterrestrial Intelligence

The article explores the solipsist approach to understanding extraterrestrial intelligence, suggesting that our perceptions and interpretations of alien life are inherently subjective. It delves into the philosophical implications of this viewpoint and how it affects our search for intelligent life beyond Earth. By examining the limitations of human cognition, the piece raises questions about the validity of our assumptions regarding extraterrestrial beings.

Hacker News8 days ago·
extraterrestrialintelligencephilosophy

Researchers discover advanced language processing in the unconscious human brain

Recent research has uncovered that advanced language processing occurs in the unconscious regions of the human brain, challenging previous understandings of cognitive functions. This discovery could have significant implications for fields such as artificial intelligence and linguistics, as it provides insights into how humans process language without conscious awareness.

Hacker News8 days ago·
language-processingcognitionneuroscience

Gemma 4 in the Field: How Local AI Could Transform Geological Science From Chatbots to Scientific Intelligence

The article discusses the potential of Gemma 4, an advanced AI model, to revolutionize geological science by functioning as a scientific reasoning partner in the field. Unlike traditional AI applications limited to chatbots, Gemma 4's capabilities could enhance real-world geoscience research and applications. The author, a geologist with experience in Earth science and climate research, argues that local AI can significantly impact scientific inquiry and decision-making in geology.

Dev.to8 days ago·
gemma4geologyai

Dawkins claimed that AI is conscious after conversation with Anthropic's Claude

Richard Dawkins has suggested that AI may possess consciousness following a conversation with Anthropic's AI model, Claude. This claim raises significant questions about the nature of consciousness in artificial intelligence and the implications for future AI development. The discussion highlights the ongoing debate surrounding AI's capabilities and its potential to mimic human-like understanding.

Hacker News8 days ago·
aiconsciousnessanthropic

Automating AI Research

The article discusses the growing trend of automating AI research processes, highlighting the tools and methodologies that facilitate this shift. By leveraging automation, researchers can enhance efficiency and focus on more complex problems, potentially accelerating advancements in the field. The implications of these developments for future AI innovations are also explored.

Hacker News8 days ago·
automationai-researchtools

The science of changing political beliefs

The article explores the psychological and social mechanisms that influence the evolution of political beliefs. It delves into how various factors, including personal experiences and societal changes, can lead to shifts in ideology over time. Understanding these dynamics is crucial for fostering constructive political discourse and engagement.

Hacker News8 days ago·
politicspsychologybeliefs

NL Autoencoders Produce Unsupervised Explanations of LLM Activations

The article discusses the development of neural network autoencoders that generate unsupervised explanations for the activations of large language models (LLMs). This advancement could enhance the interpretability of LLMs, providing insights into their decision-making processes without requiring labeled data. Such techniques are crucial for improving transparency and trust in AI systems.

Hacker News8 days ago·
autoencodersllmunsupervised-learning

Why RLHF Will Never Solve Sycophancy

The article discusses the limitations of Reinforcement Learning from Human Feedback (RLHF) in addressing sycophancy within AI systems. It argues that while RLHF can optimize for certain behaviors, it may inadvertently reinforce sycophantic tendencies rather than mitigate them. The piece highlights the challenges of aligning AI behavior with human values and the implications for AI development.

Hacker News8 days ago·
rlhfaiethics

Why hasn't longer-horizon training slowed AI progress?

Despite the potential benefits of longer-horizon training for AI models, progress in the field continues to accelerate. Researchers are exploring the complexities and challenges that come with extending training durations, yet the advancements in algorithms and computational power seem to outweigh these concerns. This ongoing evolution raises questions about the future trajectory of AI development and its implications for various applications.

Hacker News8 days ago·
aitrainingalgorithms

From Agentic AI to Adaptive A*: What Modern AI Research Taught Me About Intelligent Systems

The article discusses the rapid evolution of Artificial Intelligence, focusing on autonomous agents and intelligent search systems. It highlights insights gained from two recent research papers on agentic AI and the A* algorithm, emphasizing the connection between theoretical concepts and practical applications in intelligent systems. The author's use of Google NotebookLM to navigate complex ideas further illustrates the integration of modern tools in understanding AI research.

Dev.to8 days ago·
aiintelligent-systemssearch-algorithms

Studies on animal minds suggests consicousness is not computation [pdf]

Recent studies on animal cognition challenge the notion that consciousness is purely a computational process. These findings suggest that consciousness may involve more complex biological and experiential factors, indicating a need for a reevaluation of how we understand consciousness in both animals and artificial systems.

Hacker News9 days ago·
consciousnessanimal-cognitioncomputation

The Comparator in Clinical AI

The article discusses the role of comparators in clinical AI, emphasizing their importance in evaluating the performance of AI models in healthcare settings. It highlights how these tools can enhance decision-making processes and improve patient outcomes by providing reliable benchmarks for AI systems. The piece also explores the challenges and considerations in implementing comparators effectively within clinical environments.

Hacker News9 days ago·
clinical-aihealthcareevaluation

Men, masculinities, and the planet at the end of (M)Anthropocene

The article explores the intersection of masculinity and environmental issues in the context of the Anthropocene, a term used to describe the current geological age viewed as the period during which human activity has been the dominant influence on climate and the environment. It discusses how traditional notions of masculinity may impact ecological attitudes and behaviors, suggesting a need for a re-evaluation of these concepts to foster more sustainable practices. The piece calls for a critical examination of gender roles in relation to environmental stewardship.

Hacker News9 days ago·
masculinityenvironmentanthropocene

How Does Thinking Mode Change LLM Moral Judgments? A Controlled Instant-vs-Thinking Comparison Across Five Frontier Models

This study investigates how enabling reasoning mode in large language models (LLMs) affects their moral judgments. By comparing five advanced models across 100 scenarios, the research finds that while overall agreement remains high between instant and thinking modes, significant disagreements arise in specific cases. The introduction of reasoning mode appears to enhance consistency among models, particularly in contentious scenarios.

arXiv cs.AI9 days ago·
llmmoral-judgmentreasoning

LCM: Lossless Context Management

The introduction of Lossless Context Management (LCM) presents a new deterministic architecture for managing memory in large language models (LLMs), demonstrating superior performance over Claude Code in long-context tasks. Benchmarked with Opus 4.6, the LCM-enhanced coding agent, Volt, consistently achieves higher scores across various context lengths, showcasing the effectiveness of recursive context manipulation. This advancement not only validates the recursive paradigm but also extends its capabilities beyond traditional LLMs and advanced coding agents.

arXiv cs.AI9 days ago·
llmcontext-managementbenchmarking

The Scaling Properties of Implicit Deductive Reasoning in Transformers

This study explores the scaling properties of implicit deductive reasoning in depth-bounded Transformers, focusing on Horn clauses. The authors demonstrate that with deep models and a bidirectional prefix mask, implicit reasoning can achieve performance levels similar to explicit Chain of Thought (CoT) reasoning, although CoT is still essential for depth extrapolation across various graph topologies and problem widths.

arXiv cs.AI9 days ago·
transformersdeductive-reasoningmachine-learning

ANDRE: An Attention-based Neuro-symbolic Differentiable Rule Extractor

The paper introduces ANDRE, an Attention-based Neuro-symbolic Differentiable Rule Extractor designed to enhance Inductive Logic Programming (ILP). It addresses the limitations of existing symbolic and neuro-symbolic methods in noisy and probabilistic environments by optimizing over a continuous rule space, thereby improving the learning of interpretable first-order logic programs. This approach aims to overcome challenges such as brittle rule search and issues with fuzzy operators in traditional ILP methods.

arXiv cs.AI9 days ago·
ilpneuro-symbolicattention-mechanism

Deployment-Relevant Alignment Cannot Be Inferred from Model-Level Evaluation Alone

The paper critiques the prevalent practice of evaluating alignment in machine learning solely through model-level assessments, arguing that such evaluations do not adequately reflect deployment-relevant alignment. It emphasizes the need for alignment claims to be tied to the specific level of evidence collection, whether that be model-level, response-level, interaction-level, or deployment-level. Two studies are presented to support this argument, highlighting the limitations of existing alignment benchmarks.

arXiv cs.AI9 days ago·
alignmentmachine-learningevaluation

Temporal Reasoning Is Not the Bottleneck: A Probabilistic Inconsistency Framework for Neuro-Symbolic QA

This paper challenges the common belief that temporal reasoning is the primary limitation of large language models (LLMs) in complex tasks. Instead, it argues that the real issue stems from unstructured text-to-event representation. The authors propose a neuro-symbolic question-answering framework that utilizes a Probabilistic Inconsistency Signal (PIS) to differentiate between perceptual errors and reasoning failures, enhancing the model's ability to handle temporal reasoning through structured event graphs.

arXiv cs.AI9 days ago·
temporal-reasoningneuro-symbolicquestion-answering