研究成果

UBench: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions

2025-06-04

Adversarial Preference Learning for Robust LLM Alignment

2025-05-30

GuessArena: Guess Who I Am? A Self-Adaptive Framework for Evaluating LLMs in Domain-Specific Knowledge and Reasoning

2025-05-28

Token-level Accept or Reject: A Micro Alignment Approach for Large Language Models

2025-05-27

HopRAG: Multi-Hop Reasoning for Logic-Aware Retrieval Augmented Generation

2025-05-26

QAEncoder: Towards Aligned Representation Learning in Question Answering Systems

2025-05-26

MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System

2025-05-26

XFINDER: LARGE LANGUAGE MODELS AS AUTOMATED EVALUATORS FOR RELIABLE EVALUATION

2025-02-25

SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model

2025-02-23

FastMem: Fast Memorization of Prompt Improves Context Awareness of Large Language Models

2024-10-04

CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models

2024-07-15

Memory3 : Language Modeling with Explicit Memory

2024-07-01

NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism

2024-06-04

Grimoire is All You Need for Enhancing Large Language Models

2024-01-10

HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning

2023-12-29

Time-interval Aware Share Recommendation via Bi-directional Continuous Time Dynamic Graphs

2023-07-18