<aside>
</aside>


| 1 | Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay Yifan Sun, Huan Zhang | LLM | | --- | --- | --- | | 2 | AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time (*workshop paper) Junyu Zhang, Han Wang, Huan Zhang | LLM | | 3 | RAST: Reasoning Activation in LLMs via Small-model Transfer Siru Ouyang, Xinyu Zhu, Zilin Xiao, Minhao Jiang, Yu Meng, Jiawei Han | LLM | | 4 | When Reasoning Meets Its Laws (*workshop paper) Junyu Zhang, Yifan Sun, Huan Zhang | LLM | | 5 | FGBench: A Dataset and Benchmark for Molecular Property Reasoning at Functional Group-Level in Large Language Models Xuan Liu, Siru Ouyang, Xianrui Zhong, Jiawei Han, Huimin Zhao | LLM | | 6 | Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness Peizhi Niu, Olgica Milenkovic | LLM | | 7 | AgMMU A Comprehensive Agricultural Multimodal Understanding and Reasoning Benchmark Ziqi Pang, Yunze Man, Yuxiong Wang | LLM | | 8 | LLM Strategic Reasoning: Agentic Study through Behavioral Game Theory. Jingru Jia, Zehua Yuan, Junhao Pan, Paul McNamara, Deming Chen | LLM | | 9 | MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided Conversations Vardhan Dongre, Chi Gui, Shubham Garg, Hooshang Nayyeri, Gokhan Tur, Dilek Hakkani-Tür, Vikram S. Adve | LLM | | 10 | Cost-Efficient LLM Training with Lifetime-Aware Tensor Offloading via GPUDirect Storage Ziqi Yuan, Haoyang Zhang, Yirui Eric Zhou, Jian Huang | LLM, systems | | 11 | Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning Haozhen Zhang, Tao Feng, Jiaxuan You | LLM | | 12 | Targeted Redirecting of Agentic Preferences Jehyeok Yeon, Hangoo Kang, Gagandeep Singh | LLM | | 13 | The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning Shivam Agarwal, Zimin Zhang, Lifan Yuan, Jiawei Han, Hao Peng | LLM | | 14 | DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation Jiashuo Sun, Xianrui Zhong, Sizhe Zhou, Jiawei Han | LLM | | 15 | Abstract Rendering: Computing All that is Seen in Gaussian Splat Scenes Chenxi Ji, Yangge Li, Xiangru Zhong, Huan Zhang, Sayan Mitra | Vision | | 16 | Can NeRFs “See” without Cameras? Chaitanya Amballa, Sattwik Basu, Yu-lin Wei, Romit Roy Choudhury | Vision | | 17 | Visual Sync: Multi‑Camera Synchronization via Cross‑View Object Motion ****Shaowei Liu, David Yao, Saurabh Gupta, Shenlong Wang | Vision | | 18 | CAR: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching Chen Chen, Pengsheng Guo, Liangchen Song, Jiasen Lu, Rui Qian, Tsu-Jui Fu, Xinze Wang, Wei Liu, Yinfei Yang, Alex Schwing | Vision | | 19 | NoPo-Avatar: Generalizable and Animatable Avatars from Sparse Inputs without Human Poses Jing Wen, Alex Schwing, Shenlong Wang | Vision | | 20 | Fire360: A Benchmark for Robust Perception and Episodic Memory in Degraded 360° Firefighting Video Aditi Tiwari, Farzaneh Masoud, Dac Trong Nguyen, Jill Kraft, Heng Ji, Klara Nahrstedt | Vision | | 21 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders Savya Khosla, Sethuraman T V, Barnett Lee, Alex Schwing, Derek Hoiem | Vision | | 22 | On Inductive Biases That Enable Generalization in Diffusion Transformers ****Jia An, De Wang, Pengsheng Guo, Jiebo Luo, Alex Schwing | Vision | | 23 | HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video Hongchi Xia, Chih-Hao Lin, Hao-Yu Hsu, Quentin Leboutet, Katelyn Gao, Michael Paulitsch, Benjamin Ummenhofer, Shenlong Wang | Vision | | 24 | MR. Video: MapReduce as an Effective Principle for Long Video Understanding Ziqi Pang, Yuxiong Wang | Vision | | 25 | RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes ****Fang Li, Hao Zhang, Narendra Ahuja | Vision | | 26 | Virtual Fitting Room: Generating Arbitrarily Long Videos of Virtual Try-On from a Single Image Junkun Chen, Yuxiong Wang | Vision | | 27 | One Token per Highly Selective Frame: Towards Extreme Compression for Long Video Understanding Zheyu Zhang, Ziqi Pang, Yuxiong Wang | Vision | | 28 | DiffEye: Diffusion-Based Continuous Eye-Tracking Data Generation Conditioned on Natural Images Ozgur Kara, Harris Nisar, James M. Rehg | Vision, Generative | | 29 | DMol: A Schedule-Driven Diffusion Model for Highly Efficient and Versatile Molecule Generation Peizhi Niu, Shane Wang, Olgica Milenkovic | Generative | | 30 | Model Context Protocol for Vision Agents: Schema, Memory, and World Model Implications Aditi Tiwari, Akshit Bhalla, Darshan Prasad | Vision, AI Agents | | 31 | GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents Rui Yang, Huan Zhang | AI Agents | | 32 | Self-Guided Hierarchical Exploration for Generalist Foundation Model Web Agents Qianlan Yang, Yuxiong Wang | AI Agents | | 33 | Two‑Stage Learning of Stabilizing Neural Controllers via Zubov Sampling and Iterative Domain Expansion Xiangru Zhong, Haoyu Li, Bin Hu, Huan Zhang | ML | | 34 | Efficient Utility-Preserving Machine Unlearning Tianbai Yu | ML | | 35 | Hybrid Latent Reasoning via Reinforcement Learning Zhenrui Yue, Bowen Jin, Huimin Zeng, Honglei Zhuang, Zhen Qin, Jinsung Yoon, Lanyu Shang, Jiawei Han, Dong Wang | RL | | 36 | Sotopia-RL: Reward Design for Social Intelligence (workshop paper) Haofei Yu, Jiaxuan You | RL | | 37 | Scalable Policy-Based RL Algorithms for POMDPs ****Amey Anjarlekar, Rasoul Etesami, R. Srikant | Theory, RL | | 38 | Detection Is All You Need: A Feasible Optimal Prior-Free Black-Box Approach For Piecewise Stationary Bandits Argyrios Gerogiannis, Yu-Han Huang, Subhonmesh Bose, Venugopal V Veeravalli | Theory | | 39 | Riemannian Consistency Model Chaoran Cheng, Yusong Wang, Yuxin Chen, Xiangxin Zhou, Nanning Zheng, Ge Liu | Theory | | 40 | Sketched Adaptive Distributed Deep Learning: A Sharp Convergence Analysis Zhijie Chen, Qiaobo Li, Arindam Banerjee | Theory | | 41 | Distributionally Robust Performative Optimization Zhuangzhuang Jia, Roy Dong, Grani Hanasusanto | Theory | | 42 | Sketched Gaussian Mechanism for Private Federated Learning Qiaobo Li, Zhijie Chen, Arindam Banerjee | Theory | | 43 | Generative Caching for Structurally Similar Pts and Responses Sarthak Chakraborty, Suman Nath, Xuchao Zhang, Chetan Bansal, Indranil Gupta | Infrastructure | | 44 | Learning Reconfigurable Representations for Multimodal Federated Learning with Missing Data Duong Nguyen | Federated learning | | 45 | Clip-and-Verify: Linear Constraint-Driven Domain Clipping for Accelerating Neural Network Verification Duo Zhou, Grani Hanasusanto, Huan Zhang | NN Verification |
⬅️ Back to AI research website.