arXiv Paper Feed | Coding Blog

ActionParty: Multi-Subject Action Binding in Generative Video Games

📅 2026-04-02 cs.CVcs.AIcs.LG

Alexander Pondaven, Ziyi Wu, Igor Gilitschenski, Philip Torr, Sergey Tulyakov (+2 more)

Recent advances in video diffusion have enabled the development of "world models" capable of simulating interactive environments. However, these models are largely restricted to single-agent...

📄 View PDF 🔗 arXiv

Grounded Token Initialization for New Vocabulary in LMs for Generative Recommendation

📅 2026-04-02 cs.CLcs.AIcs.LG

Daiwei Chen, Zhoutong Fu, Chengming Jiang, Haichao Zhang, Ran Zhou (+10 more)

Language models (LMs) are increasingly extended with new learnable vocabulary tokens for domain-specific tasks, such as Semantic-ID tokens in generative recommendation. The standard practice...

📄 View PDF 🔗 arXiv

Batched Contextual Reinforcement: A Task-Scaling Law for Efficient Reasoning

📅 2026-04-02 cs.LGcs.AIcs.CL

Bangji Yang, Hongbo Ma, Jiajun Fan, Ge Liu

Large Language Models employing Chain-of-Thought reasoning achieve strong performance but suffer from excessive token consumption that inflates inference costs. Existing efficiency methods such as...

📄 View PDF 🔗 arXiv

Large-scale Codec Avatars: The Unreasonable Effectiveness of Large-scale Avatar Pretraining

📅 2026-04-02 cs.CVcs.GR

Junxuan Li, Rawal Khirodkar, Chengan He, Zhongshi Jiang, Giljoo Nam (+35 more)

High-quality 3D avatar modeling faces a critical trade-off between fidelity and generalization. On the one hand, multi-view studio data enables high-fidelity modeling of humans with precise control...

📄 View PDF 🔗 arXiv

Topological Effects in Neural Network Field Theory

📅 2026-04-02 hep-thcs.LG

Christian Ferko, James Halverson, Vishnu Jejjala, Brandon Robinson

Neural network field theory formulates field theory as a statistical ensemble of fields defined by a network architecture and a density on its parameters. We extend the construction to topological...

📄 View PDF 🔗 arXiv

go-$m$HC: Direct Parameterization of Manifold-Constrained Hyper-Connections via Generalized Orthostochastic Matrices

📅 2026-04-02 cs.LGcs.CL

Torque Dandachi, Sophia Diggs-Galligan

Doubly stochastic matrices enable learned mixing across residual streams, but parameterizing the set of doubly stochastic matrices (the Birkhoff polytope) exactly and efficiently remains an open...

📄 View PDF 🔗 arXiv

PARD-SSM: Probabilistic Cyber-Attack Regime Detection via Variational Switching State-Space Models

📅 2026-04-02 cs.CR

Prakul Sunil Hiremath, PeerAhammad M Bagawan, Sahil Bhekane

Modern adversarial campaigns unfold as sequences of behavioural phases - Reconnaissance, Lateral Movement, Intrusion, and Exfiltration - each often indistinguishable from legitimate traffic when...

📄 View PDF 🔗 arXiv

Taming the Exponential: A Fast Softmax Surrogate for Integer-Native Edge Inference

📅 2026-04-02 cs.LGcs.AR

Dimitrios Danopoulos, Enrico Lupi, Michael Kagan, Maurizio Pierini

Softmax can become a computational bottleneck in the Transformer model's Multi-Head Attention (MHA) block, particularly in small models under low-precision inference, where exponentiation and...

📄 View PDF 🔗 arXiv

Omni123: Exploring 3D Native Foundation Models with Limited 3D Data by Unifying Text to 2D and 3D Generation

📅 2026-04-02 cs.CVcs.AI

Chongjie Ye, Cheng Cao, Chuanyu Pan, Yiming Hao, Yihao Zhi (+2 more)

Recent multimodal large language models have achieved strong performance in unified text and image understanding and generation, yet extending such native capability to 3D remains challenging due to...

📄 View PDF 🔗 arXiv

Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing

📅 2026-04-02 cs.LGcs.AI

Gengsheng Li, Tianyu Yang, Junfeng Fang, Mingyang Song, Mao Zheng (+4 more)

Reinforcement learning with verifiable rewards (RLVR) has become a standard paradigm for post-training large language models. While Group Relative Policy Optimization (GRPO) is widely adopted, its...

📄 View PDF 🔗 arXiv

AlloyVAE: A generative model for complex probabilistic field-to-field relationships in alloys

📅 2026-04-02 cond-mat.mtrl-sci

Ningyu Yan, Zhuocheng Xie, Kai Guo, Yejun Gu, Huajian Gao (+1 more)

The inherent compositional heterogeneity of multi-principal element alloys (MPEAs) gives rise to complex, spatially varying mechanical fields that cannot be uniquely determined from coarse-grained...

📄 View PDF 🔗 arXiv

De Jure: Iterative LLM Self-Refinement for Structured Extraction of Regulatory Rules

📅 2026-04-02 cs.AIcs.CLcs.LG

Keerat Guliani, Deepkamal Gill, David Landsman, Nima Eshraghi, Krishna Kumar (+1 more)

Regulatory documents encode legally binding obligations that LLM-based systems must respect. Yet converting dense, hierarchically structured legal text into machine-readable rules remains a costly,...

📄 View PDF 🔗 arXiv

Selective State-Space Models for Koopman-based Data-driven Distribution System State Estimation

📅 2026-04-02 eess.SY

Bader Alabdulrazzaq, Bri-Mathias Hodge

Distribution System State Estimation (DSSE) plays an increasingly-important role in modern power grids due to the integration of distributed energy resources (DERs). The inherent characteristics of...

📄 View PDF 🔗 arXiv

Crystalite: A Lightweight Transformer for Efficient Crystal Modeling

📅 2026-04-02 cs.LGcs.AI

Tin Hadži Veljković, Joshua Rosenthal, Ivor Lončarić, Jan-Willem van de Meent

Generative models for crystalline materials often rely on equivariant graph neural networks, which capture geometric structure well but are costly to train and slow to sample. We present Crystalite,...

📄 View PDF 🔗 arXiv

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

📅 2026-04-02 cs.LG

Zhengxi Lu, Zhiyuan Yao, Jinyang Wu, Chengcheng Han, Qi Gu (+5 more)

Agent skills, structured packages of procedural knowledge and executable resources that agents dynamically load at inference time, have become a reliable mechanism for augmenting LLM agents. Yet...

📄 View PDF 🔗 arXiv

Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

📅 2026-04-02 cs.LGcs.RO

Klemens Iten, Bruce Lee, Chenhao Li, Lenart Treven, Andreas Krause (+1 more)

Learning-based control methods typically assume stationary system dynamics, an assumption often violated in real-world systems due to drift, wear, or changing operating conditions. We study...

📄 View PDF 🔗 arXiv

Best-Arm Identification with Noisy Actuation

📅 2026-04-02 cs.ITcs.LG

Merve Karakas, Osama Hanna, Lin F. Yang, Christina Fragouli

In this paper, we consider a multi-armed bandit (MAB) instance and study how to identify the best arm when arm commands are conveyed from a central learner to a distributed agent over a discrete...

📄 View PDF 🔗 arXiv

Smoothing the Landscape: Causal Structure Learning via Diffusion Denoising Objectives

📅 2026-04-02 cs.LGstat.ML

Hao Zhu, Di Zhou, Donna Slonim

Understanding causal dependencies in observational data is critical for informing decision-making. These relationships are often modeled as Bayesian Networks (BNs) and Directed Acyclic Graphs (DAGs)....

📄 View PDF 🔗 arXiv

BVFLMSP : Bayesian Vertical Federated Learning for Multimodal Survival with Privacy

📅 2026-04-02 stat.MLcs.LG

Abhilash Kar, Basisth Saha, Tanmay Sen, Biswabrata Pradhan

Multimodal time-to-event prediction often requires integrating sensitive data distributed across multiple parties, making centralized model training impractical due to privacy constraints. At the...

📄 View PDF 🔗 arXiv

(PAC-)Learning state machines from data streams: A generic strategy and an improved heuristic (Extended version)

📅 2026-04-02 cs.FLcs.LG

Robert Baumgartner, Sicco Verwer

This is an extended version of our publication Learning state machines from data streams: A generic strategy and an improved heuristic, International Conference on Grammatical Inference (ICGI) 2023,...

📄 View PDF 🔗 arXiv

📚 arXiv Paper Feed