Skip to content

ye-kyaw-thu/wfst_nlp_tutorials

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

87 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

wfst_nlp_tutorials

This repository contains materials from a 5-hour WFST tutorial I delivered to LU Lab interns on 13 July 2025. The focus is on leveraging the power of Weighted Finite State Transducers (WFSTs) for core NLP tasks including word segmentation, POS tagging, and machine translation. Whether you’re new to WFSTs or looking for practical implementations, these tutorials provide a solid foundation for understanding their applications in NLP.

Lecture Slide

Notebooks

1.WFST_Word_Segmentation_Small_Corpus.ipynb
2.WFST_POS_Tagging_Small_Corpus.ipynb
3.WFST_MT_Small_Corpus.ipynb
4.WFST_Word_Segmentation.ipynb
5.WFST_POS_Tagging.ipynb
6.WFST_MT.ipynb

License

The Bash shell scripts, Python code, and Jupyter notebooks in this WFST-NLP tutorial repository are licensed under the MIT License.

However, the datasets used in these tutorials follow their original licenses:

  • Word Segmentation & POS Tagging Tutorial: Uses myPOS (Version 3.0), licensed as per its original source.
  • Machine Translation Tutorial: Uses the Myanmar-Rakhine Parallel Corpus (full version not yet publicly released).
    • For research purposes, I’ve included aligned phrase pairs (Myanmar-Rakhine) generated using the anymalign alignment toolkit.

Citation

If you have used Jupyter Notebooks for teaching or R&D, please cite them as follows:
(wfst_nlp_tutorials ကို သုံးဖြစ်ကြရင် အောက်ပါ citation လုပ်ပေးပါ။ ကျေးဇူးပါ။)

@misc{wfst_nlp_tutorials_2025,
  author       = {Ye Kyaw Thu},
  title        = {wfst_nlp_tutorials},
  month        = {7},
  year         = {2025},
  url          = {https://github.com/ye-kyaw-thu/wfst_nlp_tutorials},
  note         = {Accessed Date: yyyy-mm-dd},
  institution  = {LU Lab., Myanmar}
}

About

WFST for NLP: Tutorials on word segmentation, POS tagging, and machine translation.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published