Llama2-Self-Aligned-Backtranslation: Reproducing "Self Alignment with Instruction Backtranslation" (ACL 2023)

This repository provides a complete implementation of the paper Self Alignment with Instruction Backtranslation, using Llama2-7B as the base model. The project includes backward model training, self-augmentation, quality curation with LLMs, and instruction fine-tuning, all optimized for memory efficiency using LoRA.

Key Components & Deliverables

1. Backward Model Training (`M_{yx} := p(x|y)`)

Goal: Train a model to predict instructions (x) from responses (y).
Dataset: OpenAssistant-Guanaco training set (seed data).
Techniques: LoRA fine-tuning with 4-bit quantization (memory-efficient).
Model: llama2-7b-backward-model

2. Self-Augmentation (Data Generation)

Process:
1. Randomly sample 150 single-turn completions from the LIMA dataset.
2. Generate instructions for these completions using the backward model.
3. Filter out multi-turn dialogues (e.g., conversations with >2 turns).

3. Self-Curation

Used meta/llama-7b-chat-hf to score instruction-response pairs (1-5 scale).
Selected high-quality examples (score ≥4) and discarded low-quality ones (score ≤2).
Curated Dataset: backtranslated-lima-cleaned

4. Instruction Fine-Tuning

Fine-tuned Llama2-7B on the curated dataset with LoRA, achieving better instruction-following capabilities.
Final Model: llama2-instruction-aligned

Environment Setup

Install dependencies via:

pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
llama2-backward-output		llama2-backward-output
llama2-instruction-output-v1		llama2-instruction-output-v1
.env		.env
README.md		README.md
backtranslated_pairs.jsonl		backtranslated_pairs.jsonl
requirements.txt		requirements.txt
scored_pairs.jsonl		scored_pairs.jsonl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Llama2-Self-Aligned-Backtranslation: Reproducing "Self Alignment with Instruction Backtranslation" (ACL 2023)

Key Components & Deliverables

1. Backward Model Training (`M_{yx} := p(x|y)`)

2. Self-Augmentation (Data Generation)

3. Self-Curation

4. Instruction Fine-Tuning

Environment Setup

About

Uh oh!

Releases

Packages

Uh oh!

jasperyeoh/Self-Aligned-Llama2-Instruction-Backtranslation-Implementation

Folders and files

Latest commit

History

Repository files navigation

Llama2-Self-Aligned-Backtranslation: Reproducing "Self Alignment with Instruction Backtranslation" (ACL 2023)

Key Components & Deliverables

1. Backward Model Training (M_{yx} := p(x|y))

2. Self-Augmentation (Data Generation)

3. Self-Curation

4. Instruction Fine-Tuning

Environment Setup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

1. Backward Model Training (`M_{yx} := p(x|y)`)

Packages