mengfanxu's picture

mengfanxu

fxmeng

·

https://fxmeng.github.io

fxmeng

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

upvoted a paper 2 days ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

updated a model about 1 month ago

fxmeng/TransMLA-llama3-8b-32k

View all activity

Organizations

None yet

Collections 9

View 9 collections

Papers 4

arxiv:2508.15881

arxiv:2502.07864

arxiv:2411.17426

arxiv:2404.02948

models 53

fxmeng/TransMLA-llama3-8b-32k

8B • Updated Nov 24, 2025 • 3

fxmeng/TransMLA-llama3-8b-8k

8B • Updated Nov 24, 2025 • 34

fxmeng/PiSSA-llama-7b-commonsense-148k

7B • Updated Feb 13, 2025 • 6

fxmeng/PiSSA-Llama-3-8b-commonsense-148k

8B • Updated Feb 13, 2025 • 4

fxmeng/PiSSA-Llama-2-7b-commonsense-148k

7B • Updated Feb 13, 2025 • 7

fxmeng/PiSSA-llama-13b-commonsense-148k

13B • Updated Feb 13, 2025 • 8

fxmeng/CLOVER-llama-3-8b-commonsense-148k

8B • Updated Feb 2, 2025 • 3

fxmeng/CLOVER-llama-2-7b-commonsense-148k

7B • Updated Feb 2, 2025 • 4

fxmeng/CLOVER-llama-13b-commonsense-148k

13B • Updated Feb 2, 2025 • 4

fxmeng/CLOVER-llama-7b-commonsense-148k

7B • Updated Feb 2, 2025 • 5

datasets 12

fxmeng/transmla_pretrain_100m_tokens

Viewer • Updated Jul 5, 2025 • 100k • 35

fxmeng/transmla_pretrain_1B_tokens

Viewer • Updated Jul 5, 2025 • 1.14M • 113

fxmeng/transmla_pretrain_6B_tokens

Viewer • Updated Jul 5, 2025 • 5.94M • 230

fxmeng/pissa-dataset

Viewer • Updated Jan 8, 2025 • 844k • 1.53k • 3

fxmeng/big-bench-hard-continue-finetuning

Viewer • Updated Dec 19, 2024 • 10.3k • 92 • 1

fxmeng/commonsense_filtered

Viewer • Updated Dec 11, 2024 • 170k • 211 • 1

fxmeng/MetaMath-GSM240K

Viewer • Updated Nov 14, 2024 • 240k • 26 • 1

fxmeng/MetaMath-MATH155K

Viewer • Updated Nov 14, 2024 • 155k • 20

fxmeng/CodeFeedback-Python105K

Viewer • Updated Nov 14, 2024 • 105k • 60 • 6

fxmeng/llava_finetune_336x336

Preview • Updated Apr 26, 2024 • 13

View 12 datasets