1. A Lightweight Multi-Agent Framework for Automated Concrete Barrier Design
Wanting Wang, Xiye Ma, Yuyang He, Minghui Cheng, Ran Cao
2. IAPO: Input Attribution-Aware Policy Optimization for Tool Use in Small Multimodal Agents
Yifan Yang, Zhen Zhang, Jiayi Tian, Liyan Tan, Zheng Zhang
3. MODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning
Shang Ma, Jisheng Dang, Wencan Zhang, Yifan Zhang, Bimei Wang, Hong Peng, Bin Hu, Qi Tian, Tat-Seng Chua
6. MoCA-Agent: A Market-of-Claims Code Agent for Financial and Numerical Reasoning
Abdelrahman Abdallah, AbdelRahim A. Elmadany, Sameh Al Natour, Hasan Cavusoglu, Adam Jatowt, Muhammad Abdul-Mageed
7. A Five-Plane Reference Architecture for Runtime Governance of Production AI Agents
8. Counterexample Guided Learning in the Large using Reasoning Agents
Hongyi Liu, Frederic Sala, Thomas Reps, Adithya Murali
나머지 49편 보기
11. MedCTA: A Benchmark for Clinical Tool Agents
Tajamul Ashraf, Hyewon Jeong, Fida Mohammad Thoker, Bernard Ghanem
12. InternVideo3: Agentify Foundation Models with Multimodal Contextual Reasoning
Ziang Yan, Sheng Xia, Jiashuo Yu, Yue Wu, Tianxiang Jiang, ..., Yinan He, Kai Chen, Limin Wang, Yu Qiao, Yi Wang
13. FlowBank: Query-Adaptive Agentic Workflows Optimization through Precompute-and-Reuse
Lingzhi Yuan, Chenghao Deng, Fangxu Yu, Souradip Chakraborty, Mohammad Rostami, Furong Huang
14. APPO: Agentic Procedural Policy Optimization
Xucong Wang, Ziyu Ma, Yong Wang, Yuxiang Ji, Shidong Yang, Guanhua Chen, Pengkun Wang, Xiangxiang Chu
15. Can Open-Source LLM Agents Replace Static Application Security Testing Tools? An Empirical Assessment
Derek Yohn, Luke Flancher, Mirajul Islam, Khaled Slhoub
16. The Periodic Table of LLM Reasoning: A Structured Survey of Reasoning Paradigms, Methods, and Failure Modes
Avinash Anand, Mahisha Ramesh, Avni Mittal, Ashutosh Kumar, Erik Cambria, Zhengkui Wang, Timothy Liu, Aik Beng Ng, Simon See, Rajiv Ratn Shah
17. WorldReasoner: Evaluating Whether Language Model Agents Forecast Events with Valid Reasoning
Yizhou Chi, Eric Chamoun, Zifeng Ding, Andreas Vlachos
18. OCELOT: Inference-Leakage Budgets for Privacy-Preserving LLM Agents
19. Automated Creativity Evaluation of Language Models Across Open-Ended Tasks
Min Sen Tan, Zachary Kit Chun Choy, Syed Ali Redha Alsagoff, Nadya Yuki Wangsajaya, Mohor Banerjee, Swaagat Bikash Saikia, Alvin Chan
20. Calibration Drift Under Reasoning: How Chain-of-Thought Budgets Induce Overconfidence in Large Language Models
Prakul Sunil Hiremath, Harshit R. Hiremath
21. Measuring Epistemic Resilience of LLMs Under Misleading Medical Context
Hongjian Zhou, Xinyu Zou, Jinge Wu, Sean Wu, Junchi Yu, ..., Mingde Zeng, Lei Clifton, Linda Shapiro, Fenglin Liu, David A. Clifton
22. SG2Loc: Sequential Visual Localization on 3D Scene Graphs
Nicole Damblon, Olga Vysotska, Federico Tombari, Marc Pollefeys, Daniel Barath
23. From Content to Knowledge: Lightning Fast Long-Video Understanding with Neural Knowledge Representations
Yuchen Guan, Xiao Li, Zongyu Guo, Xiaoyi Zhang, Xiulian Peng, Chun Yuan, Yan Lu
24. INFRAMIND: Infrastructure-Aware Multi-Agent Orchestration
Ahasan Kabir, Jiaqi Xue, Mengxin Zheng, Qian Lou
25. Organize then Retrieve: Hierarchical Memory Navigation for Efficient Agents
Hao-Lun Hsu, Nikki Lijing Kuang, Boyi Liu, Zhewei Yao, Yuxiong He
27. Multi-Agent Reasoning with Adaptive Worker Allocation for Stance Detection
Meysam Sabbaghan, Arman Zareian Jahromi, Doina Caragea
28. Sovereign Assurance Boundary: Certificate-Bound Admission for Agentic Infrastructure
29. Which Models Are Our Models Built On? Auditing Invisible Dependencies in Modern LLMs
Sanjay Adhikesaven, Haoxiang Sun, Sewon Min
31. HERO: Hindsight-Enhanced Reflection from Environment Observations for Agentic Self-Distillation
Haoran Liu, Yuwei Zhang, Xiyao Li, Bohan Lyu, Jingbo Shang
32. Architecture-Aware Reinforcement Learning Makes Sliding-Window Attention Competitive in Math Reasoning
Kai Liu, Peijie Dong, Xinchen Xie, Jianfei Gao, Qipeng Guo, Xiaowen Chu, Shaoting Zhang, Kai Chen
33. Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling
Yucheng Li, Huiqiang Jiang, Yang Xu, Jianxin Yang, Yi Zhang, ..., Bo Zheng, Fei Huang, Junyang Lin, Dayiheng Liu, Jingren Zhou
34. Agent Skill Evaluation and Evolution: Frameworks and Benchmarks
Kexin Ding, Yang Zhou, Can Jin, Feng Tong, Mu Zhou, Dimitris N. Metaxas
37. Goal-Autopilot: A Verifiable Anti-Fabrication Firewall for Unattended Long-Horizon Agents
38. MPC-Patch-Bench: Security-Aware LLM Code Patch for Multi-Party Computation
Yukuan Zhang, Mengxin Zheng, Qian Lou
39. Runtime Skill Audit: Targeted Runtime Probing for Agent Skill Security
40. SwarmSense-DNN: A Trustworthy and Decentralized Neural Framework for Proactive Anomaly Defense in Consumer IoT
Jing Yang, Vijay Govindarajan, Saad Arif, Xu Xu, Mohamed Kallel, Zaffar Ahmed Shaikh, Zhe Liu, Chunhong Yuan, Lip Yee Por
41. CORE-Bench: A Comprehensive Benchmark for Code Retrieval in the Era of Agentic Coding
Fuwei Zhang, Yanzhao Zhang, Mingxin Li, Dingkun Long, Lexiang Hu, Pengjun Xie, Zhao Zhang, Fuzhen Zhuang
42. MASK: Multi-Agent Semantic K-Scheduling for Risk-Sensitive 6G Robotics
Ahmet Gunhan Aydin, Elif Tugce Ceran
43. Intelligent Automation for Embodied Benchmark Construction: Pipelines, Embodiments, Simulators, and Trends
Jinshan Lai, Jianwei Hu, Baoyang Jiang, Fengchun Zhang, Leyuan Wang, Haotian Li, Yida Wang, Tingxuan Huang, Xi Ren, Qiang Ma
44. UniIntervene: Agentic Intervention for Efficient Real-World Reinforcement Learning
Haoyuan Deng, Yitong Gao, Yudong Lin, Haichao Liu, Zhenyu Wu, Ziwei Wang
45. TimeRouter: Efficient and Adaptive Routing of Time-Series Foundation Models
Kanghui Ning, Yushan Jiang, Kashif Rasul, Anderson Schneider, Yuriy Nevmyvaka, Dongjin Song
46. Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training
Michal Chudoba, Sergey Alyaev, Petra Galuscakova, Tomasz Wiktorski
47. Bootstrapped Monitoring: Leveraging Transparent Reasoning to Oversee Stronger AI Agents
49. ParseFixer: An Agentic Framework for Document Parsing via Selective Multimodal Correction
LeKai Yu, Hao Liu, Kun Wang, Zhiran Li, Ruping Cao, Fan Liu, Yupeng Hu
50. An Entropy-based Framework for Hybrid Coalitions in Game Theory. Part I: Human Arbitration
Salome A. Sepulveda-Fontaine, Jose M. Amigo
51. AVIS: Adaptive Test-Time Scaling for Vision-Language Models
Ahmadreza Jeddi, Minh Ngoc Le, Amirhossein Kazerouni, Hakki Can Karaimer, Hue Nguyen, ..., Michael Brudno, Alex Levinshtein, Konstantinos G. Derpanis, Babak Taati, Radek Grzeszczuk
52. VICX: Generalizable Robot Manipulation via Video Generation and In-Context Operator Network
Song Chen, Linyan Xiang, Ying Zhou, Liu Yang
53. Generalization Hacking: Models Can Game Reinforcement Learning by Preventing Behavioral Generalization
55. Decoding Multimodal Cues: Unveiling the Implicit Meaning Behind Hateful Videos
Junyu Lu, Deyi Ji, Liqun Liu, Xiaokun Zhang, Youlin Wu, ..., Huan Yu, Jie Jiang, Bo Xu, Liang Yang, Hongfei Lin
56. DeceptionX: Explainable Deception Detection with Multimodal Large Language Models
Jiayu Zhang, Shuo Ye, Jiajian Huang, Yawen Cui, Taorui Wang, Wei Xia, Zeheng Wang, Haowen Tang, Hui Ma, Zitong Yu
57. UniReason-Med: A Shared Grounded Reasoning Interface for 2D-to-3D Transfer in Medical VQA
Mengzhuo Chen, Yan Shu, Chi Liu, Hongming Piao, Xidong Wang, Derek Li, Bryan Dai
58. Benchmarking Large Language Models for Safety Data Extraction
Jonas Grill, Thomas Bayer, Sören Berlinger
59. Distortion-Resilient Robotic Imitation Learning for Autonomous Cable Routing
Hao Wang, Fu-Zhao Ou, Shiqi Wang, Zhaolin Wan, Xiaopeng Fan