SproutNan

Follow

Ruixuan SproutNan

Follow

work hard, play hard

31 followers · 12 following

HKUST
Sai Kung, Hong Kong
http://sproutnan.github.io

Achievements

Achievements

Highlights

Pro

SproutNan/README.md

👋 Hi, I’m @SproutNan
👀 I’m interested in AI interpretability and Societal AI.
🌱 I’m currently learning Concept-based AI interpretation methods.

Pinned Loading

AI-Safety_SCAV AI-Safety_SCAV Public

This is the code repository for "Uncovering Safety Risks of Large Language Models through Concept Activation Vector"

Jupyter Notebook 47 9
AI-Safety_Benchmark AI-Safety_Benchmark Public

The official repository for guided jailbreak benchmark

Python 28 1
LLM_layerclassifier LLM_layerclassifier Public

Jupyter Notebook 1
BBone_Decom BBone_Decom Public

将《植物大战僵尸 Online》的 BBone 格式动画文件解码成 JSON 的工具

Python 4 3