NTT Scientists Present Breakthrough Research on AI Deep Learning at ICLR 2025
News Highlights:
-
Nine papers presented at esteemed international conference by
NTT Research and NTT R&D scientists on breakthroughs in the branch of AI called “deep learning.” -
Five papers co-authored by members of NTT Research’s new Physics of
Artificial Intelligence (PAI) Group explore fundamental elements of AI learning, understanding and growth. -
The PAI Group , established inApril 2025 , aims to deepen understanding of AI mechanisms, observe the learning and prediction behaviors of AI and heal the breach of trust between AI and human operators.
Collectively, this research breaks new ground in understanding how AI models learn, grow and overcome uncertainty—all supporting NTT’s commitment to pioneering transformative, socially resilient, sustainable and responsible AI.
“The Physics of
One paper, “Forking Paths in Neural Text Generation,” addresses the issue of estimating uncertainty in Large Language Models (LLMs) for proper evaluation and user safety. Whereas prior approaches to uncertainty estimation focus on the final answer in generated text—ignoring potentially impactful intermediate steps—this research tested the hypothesis of the existence of key forking tokens, such that re-sampling the system at those specific tokens, but not others, leads to very different outcomes. The researchers discovered many examples of forking tokens, including punctuation marks, suggesting that LLMs are often just a single token away from generating a different output.
The paper was co-authored by
Four other papers co-authored by members of the
-
“In-Context Learning of Representations:” Researchers explore the open-ended nature of LLMs (for example, their ability to in-context learn) and whether models alter these pretraining semantics to adopt alternative, context-specific ones. Findings indicate that scaling context size can flexibly re-organize model representations, possibly unlocking novel capabilities. Authors include:
Core Francisco Park 3,5,6,Andrew Lee 7, Ekdeep Singh Lubana3,5,Yongyi Yang 3,5,8,Maya Okawa 3,5,Kento Nishi 5,7,Martin Wattenberg 7 andHidenori Tanaka . -
“Competition Dynamics Shape Algorithmic Phases of In-Context Learning:” Researchers propose a synthetic sequence modeling task that involves learning to simulate a finite mixture of Markov chains. They argue that In-Context Learning (ICL) is best thought of as a mixture of different algorithms, each with its own peculiarities, instead of a monolithic capability, also implying that making general claims about ICL that hold universally across all settings may be infeasible. Authors include:
Core Francisco Park , Ekdeep Singh Lubana,Itamar Pres 9 andHidenori Tanaka . -
“Dynamics of Concept Learning and Compositional Generalization:” Researchers propose an abstraction of prior work's compositional generalization problem by introducing a structured identity mapping (SIM) task, where a model is trained to learn the identity mapping on a Gaussian mixture with structurally organized centroids. Overall, the work establishes the SIM task as a meaningful theoretical abstraction of concept learning dynamics in modern generative models. Authors include:
Yongyi Yang ,Core Francisco Park , Ekdeep Singh Lubana,Maya Okawa ,Wei Hu 8 andHidenori Tanaka . -
“A Percolation Model of Emergence: Analyzing Transformers Trained on a Formal Language:” Recognizing the need to establish the causal factors underlying the phenomenon of "emergence" in a neural network, researchers seek inspiration from the study of emergent properties in other fields and propose a phenomenological definition for the concept in the context of neural networks. Authors include: Ekdeep Singh Lubana,
Kyogo Kawaguchi 10,11,12,Robert P. Dick 9 andHidenori Tanaka .
In addition, four papers authored or co-authored by NTT R&D scientists based in
-
“Test-time Adaptation for Regression by Subspace Alignment” Authors include:
Kazuki Adachi 13,14, Shin’ya Yamaguchi13,15,Atsutoshi Kumagai 13 andTomoki Hamagami 14 . -
“Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods” Authors include:
Akira Ito 16,Masanori Yamada 16 andAtsutoshi Kumagai . -
“Positive-Unlabeled Diffusion Models for Preventing Sensitive Data Generation” Authors include:
Hiroshi Takahashi 13,Tomoharu Iwata 13,Atsutoshi Kumagai ,Yuuki Yamanaka 13 andTomoya Yamashita 13. -
“Wavelet-based Positional Representation for Long Context” Authors include:
Yui Oka 13,Taku Hasegawa 13,Kyosuke Nishida 13,Kuniko Saito 13.
ICLR 2025, the thirteenth
The NTT Research Physics of
Formally established in
_________________________ |
|
1
|
2
|
3
|
4
|
5CBS-NTT Program in Physics of Intelligence, |
6
|
7SEAS, |
8CSE, |
9
|
10Nonequilibrium Physics of Living Matter RIKEN Hakubi Research Team, |
11RIKEN Cluster for |
12
|
13
|
14
|
15
|
16
|
17
|
About
View source version on businesswire.com: https://www.businesswire.com/news/home/20250424778713/en/
NTT Research Contact:
Chief Marketing Officer
+1-312-888-5412
chris.shaw@ntt-research.com
Media Contact:
For
+1-804-500-6660
ngibiser@wireside.com
Source: