Skip to content

每月论文更新 - 2026年02月02日 #31

@github-actions

Description

@github-actions

最后更新:2026-02-02 00:05

本次更新执行命令

D:\a\MyAutoPapers\MyAutoPapers\target\release\my_auto_papers.exe --keywords=
             efficient RL,
             partial observable markov decision process/pomdp,sparse reward reinforcement learning,
             casual RL/counterfactual RL/casual reinforcement learning,
             causal inference/causal discovery/counterfactual reasoning,
             video super resolution,
             knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding,
             combinatorial game theory/xiangqi/chinese chess,
             code llm,
             speech recognition,
             zero shot tracking/few shot tracking/pose tracking/pose estimation,
             text to 3d/image to 3d/text to texture,
             automated theorem proving/interactive theorem proving/formal verification
              --exclude-keywords=multi-agent,multiagent --per-keyword-max-result=8

参数详解

  • 关键词:efficient RL, partial observable markov decision process/pomdp, sparse reward reinforcement learning, casual RL/counterfactual RL/casual reinforcement learning, causal inference/causal discovery/counterfactual reasoning, video super resolution, knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding, combinatorial game theory/xiangqi/chinese chess, code llm, speech recognition, zero shot tracking/few shot tracking/pose tracking/pose estimation, text to 3d/image to 3d/text to texture, automated theorem proving/interactive theorem proving/formal verification
  • 排除关键词:multi-agent, multiagent
  • 每关键词最大结果:8
  • 目标领域:cs, stat
  • 每关键词重试次数:3

论文汇总(202篇)

更好的阅读体验请访问 Github页面

1. efficient RL

序号 标题 日期
1 Can David Beat Goliath? On Multi-Hop Reasoning with Resource-Constrained Agents 2026-01-29
2 Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy Prefixes 2026-01-26
3 Efficient Self-Learning and Model Versioning for AI-native O-RAN Edge 2026-01-24
4 Miner:Mining Intrinsic Mastery for Data-Efficient RL in Large Reasoning Models 2026-01-08
5 Evaluating Parameter Efficient Methods for RLVR 2025-12-29
6 $π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models 2025-10-29
7 RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation 2025-09-19
8 Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation 2025-02-14

2. partial observable markov decision process/pomdp

序号 标题 日期
1 Online Risk-Averse Planning in POMDPs Using Iterated CVaR Value Function 2026-01-28
2 Structural Monotonicity in Transmission Scheduling for Remote State Estimation with Hidden Channel Mode 2026-01-27
3 Toward Learning POMDPs Beyond Full-Rank Actions and State Observability 2026-01-26
4 Neural Value Iteration 2025-11-11
5 The Landscape of Agentic Reinforcement Learning for LLMs: A Survey 2025-09-02
6 Unveiling the Black Box: A Multi-Layer Framework for Explaining Reinforcement Learning-Based Cyber Agents 2025-05-16
7 Near-Optimal Partially Observable Reinforcement Learning with Partial Online State Information 2023-06-14

3. sparse reward reinforcement learning

序号 标题 日期
1 What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning? 2025-09-04
2 LLM-Driven Intrinsic Motivation for Sparse Reward Reinforcement Learning 2025-08-25
3 SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning 2025-06-01
4 DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning 2025-05-26
5 STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs 2025-05-21
6 Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model 2025-03-14
7 Hedging with Sparse Reward Reinforcement Learning 2025-03-06
8 Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations 2024-12-02

4. casual RL/counterfactual RL/casual reinforcement learning

序号 标题 日期
1 Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL 2025-02-18
2 Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation 2020-12-16

5. causal inference/causal discovery/counterfactual reasoning

序号 标题 日期
1 ECSEL: Explainable Classification via Signomial Equation Learning 2026-01-29
2 Modeling Endogenous Logic: Causal Neuro-Symbolic Reasoning Model for Explainable Multi-Behavior Recommendation 2026-01-29
3 Causal Discovery for Explainable AI: A Dual-Encoding Approach 2026-01-29
4 AC2L-GAD: Active Counterfactual Contrastive Learning for Graph Anomaly Detection 2026-01-29
5 Causal Inference in Biomedical Imaging via Functional Linear Structural Equation Models 2026-01-28
6 Exact Graph Learning via Integer Programming 2026-01-28
7 MuVaC: AVariational Causal Framework for Multimodal Sarcasm Understanding in Dialogues 2026-01-28
8 A generative machine learning model for designing metal hydrides applied to hydrogen storage 2026-01-28
9 Should I Have Expressed a Different Intent? Counterfactual Generation for LLM-Based Autonomous Control 2026-01-27
10 Decoupling and randomization for double-indexed permutation statistics 2026-01-27
11 GraIP: A Benchmarking Framework For Neural Graph Inverse Problems 2026-01-26
12 Smooth, Sparse, and Stable: Finite-Time Exact Skeleton Recovery via Smoothed Proximal Gradients 2026-01-26
13 LungCRCT: Causal Representation based Lung CT Processing for Lung Cancer Treatment 2026-01-26
14 Ordering-based Causal Discovery via Generalized Score Matching 2026-01-22
15 From Generative Engines to Actionable Simulators: The Imperative of Physical Grounding in World Models 2026-01-21
16 Causal Regime Detection in Energy Markets With Augmented Time Series Structural Causal Models 2025-11-06
17 Causal Graph Neural Networks for Healthcare 2025-11-04
18 DeFacto: Counterfactual Thinking with Images for Enforcing Evidence-Grounded and Faithful Reasoning 2025-09-25
19 Networks of Causal Abstractions: A Sheaf-theoretic Framework 2025-09-25
20 Causality-guided Prompt Learning for Vision-language Models via Visual Granulation 2025-09-04
21 Differentiable Cyclic Causal Discovery Under Unmeasured Confounders 2025-08-11
22 A Bayesian approach to the survivor average causal effect in cluster-randomized crossover trials 2025-05-27
23 Causal Meta-Analysis: Rethinking the Foundations of Evidence-Based Medicine 2025-05-26
24 PROPHET: An Inferable Future Forecasting Benchmark with Causal Intervened Likelihood Estimation 2025-04-02

6. video super resolution

序号 标题 日期
1 OSDEnhancer: Taming Real-World Space-Time Video Super-Resolution with One-Step Diffusion 2026-01-28
2 S3-CLIP: Video Super Resolution for Person-ReID 2026-01-13
3 HATIR: Heat-Aware Diffusion for Turbulent Infrared Video Super-Resolution 2026-01-08
4 Seeing the Unseen: Zooming in the Dark with Event Cameras 2026-01-05
5 Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion 2025-12-29
6 FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring 2025-12-04
7 HunyuanVideo 1.5 Technical Report 2025-11-24
8 STCDiT: Spatio-Temporally Consistent Diffusion Transformer for High-Quality Video Super-Resolution 2025-11-24

7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding

序号 标题 日期
1 Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts 2026-01-29
2 SIA: Symbolic Interpretability for Anticipatory Deep Reinforcement Learning in Network Control 2026-01-29
3 Bridging Graph Structure and Knowledge-Guided Editing for Interpretable Temporal Knowledge Graph Reasoning 2026-01-29
4 OVD: On-policy Verbal Distillation 2026-01-29
5 Visual Disentangled Diffusion Autoencoders: Scalable Counterfactual Generation for Foundation Models 2026-01-29
6 The Double-Edged Sword of Knowledge Transfer: Diagnosing and Curing Fairness Pathologies in Cross-Domain Recommendation 2026-01-29
7 CE-GOCD: Central Entity-Guided Graph Optimization for Community Detection to Augment LLM Scientific Question Answering 2026-01-29
8 Computational investigation of single herbal drugs in Ayurveda for diabetes and obesity using knowledge graph and network pharmacology 2026-01-29
9 PathReasoner-R1: Instilling Structured Reasoning into Pathology Vision-Language Model via Knowledge-Guided Policy Optimization 2026-01-29
10 Thinking Broad, Acting Fast: Latent Reasoning Distillation from Multi-Perspective Chain-of-Thought for E-Commerce Relevance 2026-01-29
11 Semantic-Guided Dynamic Sparsification for Pre-Trained Model-based Class-Incremental Learning 2026-01-29
12 Dynamical Adapter Fusion: Constructing A Global Adapter for Pre-Trained Model-based Class-Incremental Learning 2026-01-29
13 Grounding and Enhancing Informativeness and Utility in Dataset Distillation 2026-01-29
14 From Scattered to Structured: A Vision for Automating Architectural Knowledge Management 2026-01-27
15 DREAMSTATE: Diffusing States and Parameters for Recurrent Large Language Models 2026-01-27
16 Cutting the Gordian Knot: Detecting Malicious PyPI Packages via a Knowledge-Mining Framework 2026-01-23
17 Graph-Anchored Knowledge Indexing for Retrieval-Augmented Generation 2026-01-23
18 LaVR: Scene Latent Conditioned Generative Video Trajectory Re-Rendering using Large 4D Reconstruction Models 2026-01-21
19 Human Simulation Computation: A Human-Inspired Framework for Adaptive AI Systems 2026-01-20
20 Enginuity: Building an Open Multi-Domain Dataset of Complex Engineering Diagrams 2026-01-19
21 UniHash: Unifying Pointwise and Pairwise Hashing Paradigms 2026-01-14
22 Knowledge-Embedded and Hypernetwork-Guided Few-Shot Substation Meter Defect Image Generation Method 2026-01-14
23 EfficientFSL: Enhancing Few-Shot Classification via Query-Only Tuning in Vision Transformers 2026-01-13
24 Computing Supported Models via Transformation to Stable Models 2025-12-05
25 Wikontic: Constructing Wikidata-Aligned, Ontology-Aware Knowledge Graphs with Large Language Models 2025-11-29
26 Two Heads are Better than One: Distilling Large Language Model Features Into Small Models with Feature Decomposition and Mixture 2025-11-10
27 The Imperfect Learner: Incorporating Developmental Trajectories in Memory-based Student Simulation 2025-11-08
28 VADTree: Explainable Training-Free Video Anomaly Detection via Hierarchical Granularity-Aware Tree 2025-10-26
29 RestoRect: Degraded Image Restoration via Latent Rectified Flow & Feature Distillation 2025-09-27
30 SONAR: Self-Distilled Continual Pre-training for Domain Adaptive Audio Representation 2025-09-19
31 Arce: Augmented Roberta with Contextualized Elucidations for Ner in Automated Rule Checking 2025-08-10
32 X-SAM: From Segment Anything to Any Segmentation 2025-08-06
33 GraphRAG-R1: Graph Retrieval-Augmented Generation with Process-Constrained Reinforcement Learning 2025-07-31
34 Bridging Weakly-Supervised Learning and VLM Distillation: Noisy Partial Label Learning for Efficient Downstream Adaptation 2025-06-03
35 From Limited Labels to Open Domains:An Efficient Learning Method for Drone-view Geo-Localization 2025-03-10
36 RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation 2025-02-16
37 A Concept-Centric Approach to Multi-Modality Learning 2024-12-18
38 One Model, Any Conjunctive Query: Graph Neural Networks for Answering Queries over Incomplete Knowledge Graphs 2024-09-21

8. combinatorial game theory/xiangqi/chinese chess

序号 标题 日期
1 Additive sink subtraction 2026-01-26
2 Variants of Wythoff game with terminal positions or blocking maneuvers 2025-12-12
3 Impartial Games with Activeness 2025-11-26
4 $\mathcal{L}\mathcal{R}$-Ending partisan rulesets 2025-11-18
5 Various Diamond Properties in Combinatorial Game Theory 2025-09-26
6 Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning 2025-07-16
7 On 3-terminal positions in Hex 2025-07-11
8 A number game reconciliation 2025-07-07
9 Deep Reinforcement Learning Xiangqi Player with Monte Carlo Tree Search 2025-06-18
10 Computational and Algebraic Structure of Board Games 2025-02-18
11 RemoteChess: Enhancing Older Adults' Social Connectedness via Designing a Virtual Reality Chinese Chess (Xiangqi) Community 2025-02-17
12 Complete Implementation of WXF Chinese Chess Rules 2024-12-23
13 Mastering Chinese Chess AI (Xiangqi) Without Search 2024-10-07
14 XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi 2024-07-05
15 Shogi and Frieze group 2023-11-15
16 JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games 2023-08-09
17 Niel's Chess -- Rules for Xiangqi 2023-06-27
18 On the complexity of Dark Chinese Chess 2021-12-06
19 Cumulative Games: Who is the current player? 2020-05-13

9. code llm

序号 标题 日期
1 Multi-task Code LLMs: Data Mix or Model Merge? 2026-01-28
2 Usage, Effects and Requirements for AI Coding Assistants in the Enterprise: An Empirical Study 2026-01-27
3 AlignCoder: Aligning Retrieval with Target Intent for Repository-Level Code Completion 2026-01-27
4 Evaluating and Achieving Controllable Code Completion in Code LLM 2026-01-22
5 Can LLMs Compress (and Decompress)? Evaluating Code Understanding and Execution via Invertibility 2026-01-19
6 Model See, Model Do? Exposure-Aware Evaluation of Bug-vs-Fix Preference in Code LLMs 2026-01-15
7 X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests 2026-01-11
8 Beyond Functional Correctness: Exploring Hallucinations in LLM-Generated Code 2024-04-01

10. speech recognition

序号 标题 日期
1 Towards Robust Dysarthric Speech Recognition: LLM-Agent Post-ASR Correction Beyond WER 2026-01-29
2 Qwen3-ASR Technical Report 2026-01-29
3 asr_eval: Algorithms and tools for multi-reference and streaming speech recognition evaluation 2026-01-28
4 Text-only adaptation in LLM-based ASR through text denoising 2026-01-28
5 Reducing Prompt Sensitivity in LLM-based Speech Recognition Through Learnable Projection 2026-01-28
6 A Study of Data Selection Strategies for Pre-training Self-Supervised Speech Models 2026-01-28
7 Unheard in the Digital Age: Rethinking AI Bias and Speech Diversity 2026-01-26
8 CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition 2025-02-03

11. zero shot tracking/few shot tracking/pose tracking/pose estimation

序号 标题 日期
1 QuaMo: Quaternion Motions for Vision-based 3D Human Kinematics Capture 2026-01-27
2 On the Role of Depth in Surgical Vision Foundation Models: An Empirical Study of RGB-D Pre-training 2026-01-26
3 ME-WARD: A multimodal ergonomic analysis tool for musculoskeletal risk assessment from inertial and video data in working plac 2026-01-24
4 GPA-VGGT:Adapting VGGT to Large Scale Localization by Self-Supervised Learning with Geometry and Physics Aware Loss 2026-01-23
5 DeltaDorsal: Enhancing Hand Pose Estimation with Dorsal Features in Egocentric Views 2026-01-21
6 Diffusion-based Inverse Model of a Distributed Tactile Sensor for Object Pose Estimation 2026-01-19
7 Dual-quaternion learning control for autonomous vehicle trajectory tracking with safety guarantees 2026-01-06
8 UniAct: Unified Motion Generation and Action Streaming for Humanoid Robots 2025-12-30
9 KV-Tracker: Real-Time Pose Tracking with Transformers 2025-12-27
10 ParaMaP: Parallel Mapping and Collision-free Motion Planning for Reactive Robot Manipulation 2025-12-27
11 Tracking by Predicting 3-D Gaussians Over Time 2025-12-27
12 Optical Flow-Guided 6DoF Object Pose Tracking with an Event Camera 2025-12-24
13 Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence 2025-12-04
14 RacketVision: A Multiple Racket Sports Benchmark for Unified Ball and Racket Analysis 2025-11-21
15 PLANA3R: Zero-shot Metric Planar 3D Reconstruction via Feed-Forward Planar Splatting 2025-10-21
16 When Tracking Fails: Analyzing Failure Modes of SAM2 for Point-Based Tracking in Surgical Videos 2025-10-02
17 FERA: A Pose-Based Semantic Pipeline for Automated Foil Fencing Refereeing 2025-09-23
18 Seg2Track-SAM2: SAM2-based Multi-object Tracking and Segmentation for Zero-shot Generalization 2025-09-15
19 PyCAT4: A Hierarchical Vision Transformer-based Framework for 3D Human Pose Estimation 2025-08-04
20 From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation 2025-07-17
21 Matching Anything by Segmenting Anything 2024-06-06
22 Multi-Scale Memory Comparison for Zero-/Few-Shot Anomaly Detection 2023-08-09
23 Fusion of Visual-Inertial Odometry with LiDAR Relative Localization for Cooperative Guidance of a Micro-Scale Aerial Vehicle 2023-06-30
24 APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD 2023-05-27
25 Unifying Tracking and Image-Video Object Detection 2022-11-20
26 Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations 2022-06-21
27 The Multi-speaker Multi-style Voice Cloning Challenge 2021 2021-04-05

12. text to 3d/image to 3d/text to texture

序号 标题 日期
1 RoamScene3D: Immersive Text-to-3D Scene Generation via Adaptive Object-aware Roaming 2026-01-27
2 TIGaussian: Disentangle Gaussians for Spatial-Awared Text-Image-3D Alignment 2026-01-27
3 Splat-Portrait: Generalizing Talking Heads with Gaussian Splatting 2026-01-26
4 Spherical Geometry Diffusion: Generating High-quality 3D Face Geometry via Sphere-anchored Representations 2026-01-19
5 KaoLRM: Repurposing Pre-trained Large Reconstruction Models for Parametric 3D Face Reconstruction 2026-01-19
6 Proc3D: Procedural 3D Generation and Parametric Editing of 3D Shapes with Large Language Models 2026-01-18
7 PartMotionEdit: Fine-Grained Text-Driven 3D Human Motion Editing via Part-Level Modulation 2025-12-30
8 Cross-Level Sensor Fusion with Object Lists via Transformer for 3D Object Detection 2025-12-14
9 Geodiffussr: Generative Terrain Texturing with Elevation Fidelity 2025-11-28
10 Synthetic Object Compositions for Scalable and Accurate Learning in Detection, Segmentation, and Grounding 2025-10-10
11 CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion 2025-08-15
12 LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks 2025-07-27
13 TexTailor: Customized Text-aligned Texturing via Effective Resampling 2025-06-12
14 CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx 2025-06-05
15 CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading 2025-04-09
16 MD-ProjTex: Texturing 3D Shapes with Multi-Diffusion Projection 2025-04-03
17 Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data 2025-03-27
18 A Text-to-3D Framework for Joint Generation of CG-Ready Humans and Compatible Garments 2025-03-15
19 ProcTex: Consistent and Interactive Text-to-texture Synthesis for Part-based Procedural Models 2025-01-28
20 PanoDreamer: Optimization-Based Single Image to 360 3D Scene With Diffusion 2024-12-06
21 GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data 2024-11-27
22 TexPro: Text-guided PBR Texturing with Procedural Material Modeling 2024-10-21
23 Reconstructing Hand-Held Objects in 3D from Images and Videos 2024-04-09
24 Efficient4D: Fast Dynamic 3D Object Generation from a Single-view Video 2024-01-16

13. automated theorem proving/interactive theorem proving/formal verification

序号 标题 日期
1 SecIC3: Customizing IC3 for Hardware Security Verification 2026-01-29
2 Position: Certifiable State Integrity in Cyber-Physical Systems -- Why Modular Sovereignty Solves the Plasticity-Stability Paradox 2026-01-29
3 VERGE: Formal Refinement and Guidance Engine for Verifiable LLM Reasoning 2026-01-27
4 How to Verify a Turing Machine with Dafny 2026-01-21
5 Quantum automated theorem proving 2026-01-12
6 GDEPO: Group Dual-dynamic and Equal-right Advantage Policy Optimization with Enhanced Training Data Utilization for Sample-Constrained Reinforcement Learning 2026-01-11
7 Automated Theorem Proving for Prolog Verification 2026-01-07
8 Beyond Correctness: Exposing LLM-generated Logical Flaws in Reasoning via Multi-step Automated Theorem Proving 2025-12-29
9 Symbolic Specification and Reasoning for Quantum Data and Operations 2025-12-26
10 MSC-180: A Benchmark for Automated Formal Theorem Proving from Mathematical Subject Classification 2025-12-20
11 Advancing Mathematical Research via Human-AI Interactive Theorem Proving 2025-12-10
12 DUET: Agentic Design Understanding via Experimentation and Testing 2025-12-06
13 Formal Verification of Noisy Quantum Reinforcement Learning Policies 2025-12-01
14 Proof Strategy Extraction from LLMs for Enhancing Symbolic Provers 2025-10-11
15 L-Mosaics and Bounded Join-Semilattices in Isabelle/HOL 2025-09-24
16 An ACL2s Interface to Z3 2025-07-25
17 Interpolation with Automated First-Order Reasoning 2025-07-02
18 Quantifying Bounded Rationality: Formal Verification of Simon's Satisficing Through Flexible Stochastic Dominance 2025-07-02
19 Software is infrastructure: failures, successes, costs, and the case for formal verification 2025-06-15
20 Automated Discovery of Tactic Libraries for Interactive Theorem Proving 2025-03-31
21 LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction 2025-02-25

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions