People
Publications
Resources
Contact
Paper-Conference
Beyond Single-Task: Robust Multi-Task Length Generalization for LLMs
Length generalization, the ability to solve problems longer than those seen during training, remains a critical challenge for large …
Y. Hu
,
S. Kang
,
H. Yang
,
H. Xu
,
M. Zhang
PDF
LooGLE v2: Are LLMs Ready for Real World Long Dependency Challenges?
Z. He
,
Y. Wang
,
J. Li
,
K. Liang
,
M. Zhang
LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades
As Large Language Models (LLMs) are frequently updated, LoRA weights trained on earlier versions quickly become obsolete. The …
Y. Li
,
F. Meng
,
M. Zhang
,
S. Zhu
,
S. Wang
,
M. Xu
PDF
OCN: Effectively Utilizing Higher-Order Common Neighbors for Better Link Prediction
Common Neighbors (CNs) and their higher-order variants are important pairwise features widely used in state-of-the-art link prediction …
J. Wang
,
X. Wang
,
M. Zhang
PDF
PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models
Current benchmarks for evaluating the reasoning capabilities of Large Language Models (LLMs) face significant limitations: task …
S. Qiu
,
S. Guo
,
Z. Song
,
.Et El
,
M. Zhang
,
H. X. Zhu
PDF
Project
TransMLA: Migrating GQA Models to MLA with Full DeepSeek Compatibility and Speedup
In this paper, we present TransMLA, a framework that seamlessly converts any GQA-based pre-trained model into an MLA-based model. Our …
F. Meng
,
P. Tang
,
Z. Yao
,
X. Sun
,
M. Zhang
PDF
Code
Reconsidering the Performance of GAE in Link Prediction
Recent advancements in graph neural networks (GNNs) for link prediction have introduced sophisticated training techniques and model …
W. Ma
,
Y. Wang
,
X. Wang
,
M. Zhang
PDF
Code
CLOVER: Cross-Layer Orthogonal Vectors Pruning and Fine-Tuning
Decoder-only models generate tokens autoregressively by caching key/value vectors, but as the cache grows, inference becomes …
F. Meng
,
P. Tang
,
F. Jiang
,
M. Zhang
PDF
Geometric Representation Condition Improves Equivariant Molecule Generation
Recent advances in molecular generative models have demonstrated great promise for accelerating scientific discovery, particularly in …
Z. Li
,
C. Zhou
,
X. Wang
,
X. Peng
,
M. Zhang
PDF
Code
Griffin: Towards a Graph-Centric Relational Database Foundation Model
We introduce Griffin, the first foundation model designed specifically for Relational Databases (RDBs). Unlike previous smaller models …
Y. Wang
,
X. Wang
,
Q. Gan
,
M. Wang
,
Q. Yang
,
D. Wipf
,
M. Zhang
PDF
Code
»
Cite
×