https://github.com/jzhang38/LongMamba
Zhang Peiyuan
PY007
AI & ML interests
None yet
Organizations
EasyContext
https://github.com/jzhang38/EasyContext
-
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_1M
Viewer • Updated • 5.04k • 114 • 2 -
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 3.94k • 72 • 1 -
PY007/EasyContext-1M-Llama-2-7B
Text Generation • 7B • Updated • 5 • 4 -
PY007/slimpajama_mistral_tokenized_upsample_4096_chunk_128K
Viewer • Updated • 37.9k • 358
LongMamba
https://github.com/jzhang38/LongMamba
EasyContext
https://github.com/jzhang38/EasyContext
-
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_1M
Viewer • Updated • 5.04k • 114 • 2 -
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 3.94k • 72 • 1 -
PY007/EasyContext-1M-Llama-2-7B
Text Generation • 7B • Updated • 5 • 4 -
PY007/slimpajama_mistral_tokenized_upsample_4096_chunk_128K
Viewer • Updated • 37.9k • 358
models 5
PY007/slimpajama_LLAMA3_tokenized_chunk_512K_debug
Updated
PY007/vicuna-7b-v1.5
Text Generation • 7B • Updated • 7
PY007/EasyContext-256K-danube2-1.8b
Text Generation • 2B • Updated • 15 • 5
PY007/EasyContext-1M-Llama-2-7B
Text Generation • 7B • Updated • 5 • 4
PY007/LongMamba_16384_bs128_step400
Updated • 29 • 5
datasets 27
PY007/Attn-QAT
Viewer • Updated • 3 • 93
PY007/bf16_videos
Viewer • Updated • 3 • 122
PY007/nvfp4_videos
Viewer • Updated • 3 • 96
PY007/sage3_videos
Viewer • Updated • 3 • 179
PY007/crush-smol
Viewer • Updated • 4 • 204
PY007/slimpajama_Qwen2_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 6.79k • 338
PY007/slimpajama_Yi1.5_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 7.48k • 65
PY007/slimpajama_llama2_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 7.79k • 107
PY007/slimpajama_LLAMA3_tokenized_upsample_4096_chunk_256K
Viewer • Updated • 6.64k • 55
PY007/wild_chat_llama3_template_tokenized_merged_1M
Viewer • Updated • 1.27k • 101