- 👋 Hi, I’m @SproutNan
- 👀 I’m interested in AI interpretability and Societal AI.
- 🌱 I’m currently learning Concept-based AI interpretation methods.
Pinned Loading
-
AI-Safety_SCAV
AI-Safety_SCAV PublicThis is the code repository for "Uncovering Safety Risks of Large Language Models through Concept Activation Vector"
-
AI-Safety_Benchmark
AI-Safety_Benchmark PublicThe official repository for guided jailbreak benchmark
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

