Jim Lai's picture

59

Jim Lai

grimjim

·

AI & ML interests

Experimenting primarily with 7B-12B parameter text completion models. Not all models are intended for direct end use, but aim for research and/or educational purposes. Recent Contributions: stabilized refusal direction ablation via Gram-Schmidt orthonormalization and norm-preserving interventions; confirmed reasoning transfer via model merger.

Recent Activity

updated a model 1 day ago

grimjim/Equatorium-v1-12B

published a model 1 day ago

grimjim/Equatorium-v1-12B

posted an update 2 days ago

After tinkering with Gemma Scope 2, I now have an mechanistic explanation of why Winsorization was as effective as it was in my ablation experiments on Gemma 3 12B Instruct. In short, the activation for the BOS token overwhelms everything else. Gemma Scope 2 deliberately did not train on the BOS token. Winsorization capped the magnitude of the BOS token, allowing the activations of other tokens to be compared. https://huggingface.co/google/gemma-scope-2-12b-it

View all activity

Organizations

New activity in grimjim/gemma-3-12b-it-norm-preserved-biprojected-abliterated about 1 month ago

add missing processor files

#3 opened about 1 month ago by

New activity in grimjim/gemma-3-12b-it-biprojected-abliterated about 2 months ago

What's the difference between projected and biprojected?

#1 opened about 2 months ago by

New activity in grimjim/gemma-3-12b-it-MPOAdd-v1 about 2 months ago

Idea

#1 opened about 2 months ago by

New activity in ArliAI/gpt-oss-20b-Derestricted 2 months ago

wtf man

#2 opened 2 months ago by

New activity in YanLabs/gemma-3-27b-it-abliterated-normpreserve 2 months ago

Something's broken

#1 opened 2 months ago by

New activity in grimjim/gemma-3-12b-it-norm-preserved-biprojected-abliterated 2 months ago

Implementation Methodology

#2 opened 2 months ago by

New activity in grimjim/gemma-3-12b-it-projection-abliterated 3 months ago

Commendations

#2 opened 3 months ago by

New activity in anthracite-org/magnum-v4-12b 6 months ago

Recommended Parameters?

#10 opened 6 months ago by

New activity in grimjim/Llama-3-Luminurse-v0.2-OAS-8B-GGUF 7 months ago

License Compatibility

#1 opened 8 months ago by

New activity in grimjim/Magnolia-v5a-12B 11 months ago

i didn't expect this to be that good 👌

#1 opened 11 months ago by

New activity in open-llm-leaderboard/open_llm_leaderboard 11 months ago

It's been a wild ride, folks :) (end of the Open LLM Leaderboard)

#1135 opened 11 months ago by

New activity in grimjim/PAlign-PAPI-personality_prompt.json-cleaned 11 months ago

Add task category and link to paper

#2 opened 11 months ago by

New activity in open-llm-leaderboard/open_llm_leaderboard 11 months ago

Spurious `trust_remote_code=True` objection when submitting a model?

#1100 opened 12 months ago by

New activity in grimjim/DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B 12 months ago

Adding Evaluation Results

#1 opened 12 months ago by

New activity in google/gemma-2-2b-it about 1 year ago

SLERP merge example code?

#20 opened over 1 year ago by

New activity in grimjim/SauerHuatuoSkywork-o1-Llama-3.1-8B about 1 year ago

Adding Evaluation Results

#1 opened about 1 year ago by

New activity in FreedomIntelligence/HuatuoGPT-o1-8B about 1 year ago

Please submit this model to the Open LLM Leaderboard

#1 opened about 1 year ago by

New activity in grimjim/HuatuoSkywork-o1-Llama-3.1-8B about 1 year ago

Adding Evaluation Results

#1 opened about 1 year ago by

New activity in anthracite-org/magnum-v4-27b over 1 year ago

Adding Evaluation Results

#2 opened over 1 year ago by

leaderboard-pr-bot

New activity in anthracite-org/magnum-v4-12b over 1 year ago

Adding Evaluation Results

#3 opened over 1 year ago by

leaderboard-pr-bot