Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jingwei Xu's picture
16 7 68

Jingwei Xu

ParagonLight
OrionChat's profile picture shtefcs's profile picture
·
https://njudeepengine.github.io/jingweixu/
  • paragonlight

AI & ML interests

None yet

Organizations

NJUDeepEngine's profile picture

upvoted 2 papers 2 months ago

Long-Context Attention Benchmark: From Kernel Efficiency to Distributed Context Parallelism

Paper • 2510.17896 • Published Oct 19 • 4

Adamas: Hadamard Sparse Attention for Efficient Long-Context Inference

Paper • 2510.18413 • Published Oct 21 • 4
upvoted 2 papers 7 months ago

LLM-based Automated Theorem Proving Hinges on Scalable Synthetic Data Generation

Paper • 2505.12031 • Published May 17 • 2

Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey

Paper • 2311.12351 • Published Nov 21, 2023 • 5
upvoted 2 papers about 1 year ago

MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models

Paper • 2405.13053 • Published May 19, 2024 • 1

Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines

Paper • 2410.07896 • Published Oct 10, 2024 • 2
upvoted a collection over 1 year ago

🍃 MINT-1T

Collection
Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 14 items • Updated Oct 22 • 64
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs