Skip to content
View gsmoon97's full-sized avatar

Highlights

  • Pro

Block or report gsmoon97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
gsmoon97/README.md

Hi there πŸ‘‹ I'm Geonsik Moon

MSCS @ Columbia University | ML Engineer | LLM Training & Evaluation

gsmoon97 website google-scholar orcid

🎯 About Me

I specialize in Large Language Models (LLMs) and Natural Language Processing (NLP), with a focus on advancing the state-of-the-art in downstream tasks such as semantic understanding, timeline summarization, and grammatical error correction. My work bridges cutting-edge research (4 ACL publications) and production-grade systems (ByteDance, Apple), making AI accessible and practical. Last but not least, my first name "Geonsik" is pronounced as "Gun-Shik" or /kʌn.Ι•ik/. You can also just call me "GS" πŸ™‹πŸ»β€β™‚οΈ

πŸš€ Core AI/ML Expertise

Large Language Models

  • Fine-tuning & Optimization: LoRA-based PEFT, 4-bit quantization, hyperparameter tuning with W&B
  • Model Deployment: vLLM serving, LangChain pipelines, ChromaDB integration, AWS Bedrock inference
  • Models: Mistral, Llama 2/3, FLAN-T5, GPT family, OpenAI GPT-OSS-20B
  • Structured Outputs: Using instructor library with Pydantic models for reliable LLM responses
  • Research: Incremental clustering algorithms using LLM-based pairwise classification

Natural Language Processing

  • Grammatical Error Correction (GEC): Sequence-to-sequence & sequence tagging approaches
  • Timeline Summarization (TLS): Event detection, clustering, and narrative construction
  • Semantic Understanding: Word Sense Disambiguation (WSD), Words-in-Context (WiC)
  • Email Classification: Topic-based email classification with RAG-enriched semantic search
  • Transfer Learning: Encoder-only vs. decoder-only architectures for semantic tasks

Production ML Systems

  • Scalable Web Applications: Flask, Streamlit, Bootstrap, Docker containerization, LAMP stack
  • Microservices Architecture: GEC system with separate API and web interface modules, email processing pipelines
  • Model Serving: Production-grade deployment of transformer models and LLMs for real-time inference
  • RAG Pipelines: Retrieval-Augmented Generation with vector embeddings for semantic search

πŸ’« Featured Projects

Timeline Summarization with LLMs (ACL 2024) [Code | Paper]

  • Novel approach leveraging LLMs for incremental event clustering and timeline construction from text streams. Outperformed SOTA on 4 TLS benchmarks.
  • Tech Stack: PyTorch vLLM Llama-2-13B LangChain ChromaDB

Semantic Understanding with LLMs (ACL 2024) [Code | Paper]

  • Comprehensive framework demonstrating encoder-only models outperform decoder-only LLMs on word meaning comprehension tasks.
  • Tech Stack: PyTorch HuggingFace Transformers LoRA PEFT WandB

Email Prime: AI-Powered Email Classification & Summarization [Code | Live Demo]

  • End-to-end email processing pipeline with Streamlit web UI for Gmail integration, intelligent topic classification using AWS Bedrock LLMs, and AI-generated email thread summaries. Features incremental processing, LLM-powered topic attribute generation, RAG-enriched classification with semantic search, and complete lifecycle management (create, view, delete projects).
  • Tech Stack: Python Streamlit AWS Bedrock OpenAI GPT-OSS-20B instructor Pydantic Gmail API FAISS Amazon Titan Embeddings ChromaDB LangChain

πŸ—‚οΈ Side Projects

LLM Agent Evaluation [Code]

  • Research toolkit for analyzing LLM agent trajectories on software engineering tasks.
  • Tech Stack: Jupyter Python Agent Frameworks

Algorithm Practice [Code]

  • Self-contained archive of LeetCode solutions demonstrating strong algorithmic foundations.
  • Tech Stack: Python Data Structures Algorithms

πŸ› οΈ Technical Skills

Programming Languages

python cplusplus golang java javascript

Deep Learning & AI Frameworks

pytorch tensorflow keras jax

LLM & NLP Ecosystem

huggingface openai anthropic bedrock ibm langchain llamaindex vllm spacy nltk wandb

Data Science & Analytics

pandas numpy matplotlib scikit-learn

Web Development

react django flask streamlit fastapi nodejs bootstrap html5 css3

Cloud & Distributed Computing

aws googlecloud azure docker kubernetes

Data Infrastructure & Processing

mysql postgresql mongodb redis chromadb pinecone spark hadoop

Tools & Version Control

git linux

πŸ“š Publications

  1. From Moments to Milestones: Incremental Timeline Summarization Leveraging Large Language Models
    Qisheng Hu, Geonsik Moon, Hwee Tou Ng
    ACL 2024 (Main Conference) | [Code | Paper]

  2. Are Decoder-Only Language Models Better than Encoder-Only Language Models in Understanding Word Meaning?
    Muhammad Qorib, Geonsik Moon, Hwee Tou Ng
    ACL 2024 (Findings) | [Code | Paper]

  3. ALLECS: A Lightweight Language Error Correction System
    Muhammad Reza Qorib, Geonsik Moon, Hwee Tou Ng
    EACL 2023 (System Demonstrations) | [Code | Paper]

  4. WAMP: Writing, Annotation, and Marking Platform
    Geonsik Moon, Muhammad Reza Qorib, Daniel Dahlmeier, Hwee Tou Ng
    IJCNLP-AACL 2023 (System Demonstrations) | [Code | Paper]

πŸ“« Connect With Me

Pinned Loading

  1. LLM-TLS LLM-TLS Public

    Forked from nusnlp/LLM-TLS

    Repo for ACL2024 paper "From Moments to Milestones: Incremental Timeline Summarization Leveraging Large Language Models".

    Python

  2. llm-semantic-understanding llm-semantic-understanding Public

    A comprehensive framework for fine-tuning and evaluating Large Language Models on semantic understanding tasks (WSD & WiC) with LoRA

    Python

  3. Amazon-Bedrock-Innovation-Challenge/email-prime Amazon-Bedrock-Innovation-Challenge/email-prime Public

    AWS Bedrock-powered email intelligence system with RAG-enhanced classification and summarization, built for the Columbia Engineering X Amazon Bedrock Innovation Challenge

    Python

  4. Columbia-F1-Robotics/f1_robotics_racing_sim Columbia-F1-Robotics/f1_robotics_racing_sim Public

    Vision-based autonomous racing system comparing PPO, DQN, and GAIL with custom reward shaping across CarRacing-v3 and TORCS simulators

    Python

  5. nusnlp/ALLECS nusnlp/ALLECS Public

    The official code of ALLECS: A Lightweight Language Error Correction System

    Python 12 1

  6. nusnlp/WAMP nusnlp/WAMP Public

    The official code of WAMP: Writing, Annotation, and Marking Platform

    Rich Text Format 3 2