|
| 1 | +[中文主页](./README.zh-CN.md) |
| 2 | + |
| 3 | +# DataFlow-Eval |
| 4 | + |
| 5 | +<p align="center"> |
| 6 | + <img src="./static/images/Face.png"> |
| 7 | +</p> |
| 8 | +<a href="https://opensource.org/license/apache-2-0" target="_blank"> |
| 9 | + <img alt="License: apache-2-0" src="https://img.shields.io/github/license/saltstack/salt" /> |
| 10 | +</a> |
| 11 | +<a href="https://github.com/GAIR-NLP/ProX" target="_blank"> |
| 12 | + <img alt="GitHub Stars" src="https://img.shields.io/github/stars/Open-DataFlow/Open-DataFlow-Eval?style=social" /> |
| 13 | +</a> |
| 14 | +<a href="https://github.com/GAIR-NLP/ProX/issues" target="_blank"> |
| 15 | + <img alt="Open Issues" src="https://img.shields.io/github/issues-raw/Open-DataFlow/Open-DataFlow-Eval" /> |
| 16 | +</a> |
| 17 | + |
| 18 | +DataFlow-Eval is a data evaluation system to evaluate data quality from multiple dimensions. We mainly support SOTA algorithms within academic papers with strong theoretical support. |
| 19 | + |
| 20 | +We now support text, image, video and multimodality data types. |
| 21 | + |
| 22 | +## Table of Contents |
| 23 | +- [DataFlow-Eval](#dataflow-eval) |
| 24 | + - [Table of Contents](#table-of-contents) |
| 25 | + - [Module and Modality Support](#module-and-modality-support) |
| 26 | + - [🔥 News](#news) |
| 27 | + - [Installation](#installation) |
| 28 | + - [Quick Start](#quick-start) |
| 29 | + - [Text](#text) |
| 30 | + - [Image](#image) |
| 31 | + - [Video](#video) |
| 32 | + - [Data Evaluation Documentation](#data-evaluation-documentation) |
| 33 | + - [Text Documentation](#text-documentation) |
| 34 | + - [Image Documentation](#image-documentation) |
| 35 | + - [Video Documentation](#video-documentation) |
| 36 | + - [Data Evaluation Algorithms](#data-evaluation-algorithms) |
| 37 | + - [Text Algorithms](#text-algorithms) |
| 38 | + - [Image Algorithms](#image-algorithms) |
| 39 | + - [Video Algorithms](#video-algorithms) |
| 40 | + - [👋 Awesome Data Evaluation](#awesome-data-evaluation) |
| 41 | + |
| 42 | +## Module and Modality Support |
| 43 | + |
| 44 | +| Module\Modality | Text | Image | Video | Image-Text Pair | Video-Text Pair | |
| 45 | +| ------------------- | ---- | ----- | ----- | --------------- | --------------- | |
| 46 | +| **Data Evaluation** | ✅ | ✅ | ✅ | ✅ | ✅ | |
| 47 | + |
| 48 | +## 🔥 News |
| 49 | + |
| 50 | +- [2024-10-14] 🎉 We summarize data evaluation papers and codes in [👋 Awesome Data Evaluation](./Awesome_Data_Evaluation.md) |
| 51 | +- [2024-10-14] 🎉 Our first data-centric evaluation system is now open source. |
| 52 | + |
| 53 | +## Installation |
| 54 | + |
| 55 | + |
| 56 | +For environment setup, please using the following commands👇 |
| 57 | + |
| 58 | +``` |
| 59 | +conda create -n dataflow python=3.9 |
| 60 | +conda activate dataflow |
| 61 | +pip install -e . |
| 62 | +``` |
| 63 | + |
| 64 | +If you want to evaluate each modality of data, please use the following commands: |
| 65 | +<details> |
| 66 | +<summary> |
| 67 | +<b>text data eval</b> |
| 68 | +</summary> |
| 69 | +<p> |
| 70 | + |
| 71 | +```bash |
| 72 | +pip install -e .[text] |
| 73 | +pip install flash-attn==2.6.3 |
| 74 | +python -m spacy download en_core_web_sm |
| 75 | +``` |
| 76 | + |
| 77 | +</p> |
| 78 | +</details> |
| 79 | + |
| 80 | +<details> |
| 81 | +<summary> |
| 82 | +<b>image data eval</b> |
| 83 | +</summary> |
| 84 | +<p> |
| 85 | + |
| 86 | +```bash |
| 87 | +pip install -e .[image] |
| 88 | +pip install pyiqa==0.1.12 |
| 89 | +pip install transformers==4.44.2 |
| 90 | +``` |
| 91 | + |
| 92 | +</p> |
| 93 | +</details> |
| 94 | + |
| 95 | + |
| 96 | +<details> |
| 97 | +<summary> |
| 98 | +<b>video data eval</b> |
| 99 | +</summary> |
| 100 | +<p> |
| 101 | + |
| 102 | +```bash |
| 103 | +pip install -e .[video] |
| 104 | +``` |
| 105 | +When evaluating video-caption data, please run the following command to install modified CLIP for EMScore: |
| 106 | +``` |
| 107 | +pip install git+https://github.com/MOLYHECI/CLIP.git |
| 108 | +``` |
| 109 | + |
| 110 | +</p> |
| 111 | +</details> |
| 112 | + |
| 113 | +<details> |
| 114 | +<summary> |
| 115 | +<b>All dependencies</b> |
| 116 | +</summary> |
| 117 | +<p> |
| 118 | + |
| 119 | +```bash |
| 120 | +pip install -e .[all] |
| 121 | +pip install flash-attn==2.6.3 |
| 122 | +pip install pyiqa==0.1.12 |
| 123 | +pip install transformers==4.44.2 |
| 124 | +``` |
| 125 | + |
| 126 | +</p> |
| 127 | +</details> |
| 128 | + |
| 129 | +Please refer to Data Evaluation Documentation for config file usage. Use the following command to run with the config file |
| 130 | + |
| 131 | +``` |
| 132 | +python main.py --config [your config file] |
| 133 | +``` |
| 134 | +<p align="center"> |
| 135 | + <img src="./static/images/example_1.png"> |
| 136 | +</p> |
| 137 | + |
| 138 | +## 🚀 Quick Start |
| 139 | +### Text |
| 140 | +- [Text Evaluation Demo](./demos/text_eval/text_eval_example.ipynb) |
| 141 | +- [文本评估示例](./demos/text_eval/text_eval_example.zh-CN.ipynb) |
| 142 | +### Image |
| 143 | +- [Image Evaluation Demo](./demos/image_eval/image_eval_example.ipynb) |
| 144 | +- [图片评估示例](./demos/image_eval/image_eval_example.zh-CN.ipynb) |
| 145 | +### Video |
| 146 | +- [Video Evaluation Demo](./demos/video_eval/video_eval_example.ipynb) |
| 147 | +- [视频评估示例](./demos/video_eval/video_eval_example.zh-CN.ipynb) |
| 148 | + |
| 149 | +## Data Evaluation Documentation |
| 150 | + |
| 151 | +For the usage of evaluation, please refer to the following documents👇 |
| 152 | + |
| 153 | +### Text Documentation |
| 154 | +- [Text Data Evaluation User Documentation (English)](./dataflow/Eval/Text/README.md) |
| 155 | +- [文本数据评估使用文档 (中文)](./dataflow/Eval/Text/README.zh-CN.md) |
| 156 | + |
| 157 | +### Image Documentation |
| 158 | +- [Image Data Evaluation User Documentation (English)](./dataflow/Eval/image/README.md) |
| 159 | +- [图像数据评估使用文档 (中文)](./dataflow/Eval/image/README.zh-CN.md) |
| 160 | + |
| 161 | +### Video Documentation |
| 162 | +- [Video Data Evaluation User Documentation (English)](./dataflow/Eval/video/README.md) |
| 163 | +- [视频数据评估使用文档 (中文)](./dataflow/Eval/video/README.zh-CN.md) |
| 164 | + |
| 165 | +## Data Evaluation Algorithms |
| 166 | + |
| 167 | +We summarize the SOTA algorithms from academic papers for data evaluation. |
| 168 | +### Text Algorithms |
| 169 | +- [Text Evaluation Algorithm Document (English)](./docs/text_metrics.md) |
| 170 | +- [文本算法介绍文档 (中文)](./docs/text_metrics.zh-CN.md) |
| 171 | + |
| 172 | +### Image Algorithms |
| 173 | +- [Image Evaluation Algorithm Document (English)](./docs/image_metrics.md) |
| 174 | +- [图像数据评估使用文档 (中文)](./docs/image_metrics.zh-CN.md) |
| 175 | + |
| 176 | +### Video Algorithms |
| 177 | +- [Video Evaluation Algorithm Document (English)](./docs/video_metrics.md) |
| 178 | +- [视频数据评估使用文档 (中文)](./docs/video_metrics.zh-CN.md) |
| 179 | + |
| 180 | +## Awesome Data Evaluation |
| 181 | +- [👋 Awesome Data Evaluation](./Awesome_Data_Evaluation.md) |
0 commit comments