emi-8

Follow

Emi emi-8

Follow

AI Safety & Quality Specialist | LLM Evaluator | Building JCA-Bench | Contact: emi.e.research@gmail.com

3 followers · 3 following

emi-8/README.md

Exploring AI behavior through evaluation, red teaming, and cultural benchmarking.

Background: Google Cybersecurity certified | Lakera Gandalf AI adversarial testing (2nd place in model league)

Current Projects

🔬 Competing Circuits Across Languages — Safety vs. Instruction-Following Dynamics in Multilingual LLMs (WIP)

Interests

Mechanistic Interpretability
Multilingual AI Safety
Red Teaming

Pinned Loading

competing-circuits-multilingual competing-circuits-multilingual Public

Mechanistic interpretability study on safety circuits across EN/JA languages

HTML
prompt-playground prompt-playground Public

A playground to test prompt behavior, chaining logic, and model reactions.

Python
htb-writeups htb-writeups Public

My HTB and THM writeups.