feat: update v2.0.0 #325

shijinpjlab · 2025-12-25T09:32:08Z

No description provided.

add image rule guide

fix:Layout Prompt

* 📚 Auto-update metrics documentation * add OCR prompt * 📚 Auto-update metrics documentation * fix pylint * Update document_parsing_quality_ocr_train.py * add new ocr prompt --------- Co-authored-by: GitHub Action <[email protected]> Co-authored-by: quyuan <[email protected]>

Fix/ade/dev

feat: add 5 RAG eval metrics

* feat: temp * feat: merge_result_info 移动位置 * feat: rule已支持 * feat: fix bug * feat: prompt临时code * feat: field由list改成dict，跑通rule * feat: 改名map_data * feat: prompt与llm合并 * feat: 批量合并 * feat: delete evaldata * feat: 改名evalpipline * feat: 调整map_data * feat: 合并evaluate_rule和evaluate_prompt * feat: 并发v3 * feat: 合并evaluate_single_data与evaluate_by_type * feat: 合并execute与evaluate * feat: 修复bug并发导致的配置覆盖 * feat: 调整位置 * feat: 修改local文件，适配新版result_info和modelres的error_type（summary模块待更新） * feat: summary模块 * feat: error_type的value由reason列表改为dict，包含2个key：metric、reason * feat: update * feat: 添加ResTypeInfo类 * feat: rule_common.py更新返回 * feat: 4个rule文件更新返回 * feat: llm更新（除了type是列表） * feat: 移动位置 * feat: 移动位置引发的import修改 * feat: error_type删除一层 * feat: result_save.good判断逻辑 * feat: update * feat: rule_common.py更新res，添加label * feat: 更新res，添加label * feat: 更新res，添加label * feat: fix lint * feat: 4中base convertor * feat: plaintext情况 * feat: plaintext save * feat: json修复 * feat: jsonl修复 * feat: listjson修复 * feat: hf_plaintext.json 修复 * feat: hf_json 修复 * feat: hf_jsonl 修复 * feat: hf_listjson 修复 * feat: 修复bug 多规则结果异常 * feat: 修复bug 多规则结果异常 * feat: custom config rule 修复 * feat: fix test_local.py * feat: fix test_local.py * feat: fix test_continue.py * feat: fix test_write.py 修复复杂rule * feat: fix test_rule_common.py * feat: ImageConverter * feat: fix lint * feat: label是数组的情况 * feat: 文件夹名 * feat: example更新 * feat: 删除特殊prompt * feat: 删除prompt类 * feat: fix lint * feat: ModelRes优化赋值，Model删除prompt相关 * feat: fix lint * feat: ignore * feat: ModelRes固定字段 * update res b_box overlap and visual rule * update res b_box overlap and visual rule * feat: spark的evaluate完成 * feat: spark的summarize更新 * feat: fix bug * feat: 更新model * feat: fix lint * feat: TestModelRes * feat: chupei 特殊场景 * feat: fix bug * update res b_box overlap and visual rule * feat: delete old convertor * feat: 优化local，改prompt为llm * feat: 添加sql来源 * feat: fix lint * feat: LLMHtmlExtractCompareEn * feat: fix lint * feat: change name * feat: 删除DatasetArgs中fields功能 * feat: fix bug plaintext * feat: fix lint * feat: test ignore rag --------- Co-authored-by: pekopoke <[email protected]>

feat: sql

* feat: add Instruction Quality Evaluation * feat: add examples in metrics * 📚 Auto-update metrics documentation --------- Co-authored-by: GitHub Action <[email protected]>

feat: fix spark

…c method

feat:front page V2.0

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

feat: support parquet file

Fix/docs only

* feat: use_browser * feat: fix gemini * feat: lint

…-tests Feat/update all examples and tests

* feat: init agent&tool architecture * feat: agent&tool docs/tests/examples * fix bugs

docs: update wechat doc

gemini-code-assist · 2025-12-25T09:36:10Z

Summary of Changes

Hello @shijinpjlab, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request marks a significant architectural and functional upgrade, moving to version 2.0.0. The core evaluation engine has been redesigned for greater flexibility, allowing users to define complex evaluation pipelines with dynamic field mappings and a rich set of new LLM-based evaluators. This release also broadens data source compatibility and introduces an agent framework for advanced, tool-augmented evaluations, all while enhancing the user interface for better data visualization and interaction.

Highlights

Core Evaluation Architecture Refactor: The evaluation pipeline configuration has been completely overhauled, moving from a rigid object structure to a flexible array of evaluation pipelines (EvalPipline). This new structure allows for defining multiple evaluation groups, each with specific field mappings and a list of evaluators (rules or LLMs), significantly enhancing configurability and extensibility.
Introduction of Agent Framework: A new agent framework has been integrated, enabling LLM evaluators to perform multi-step reasoning and utilize external tools. This includes a BaseAgent class and specific implementations like AgentHallucination which can perform web searches for fact-checking, making evaluations more dynamic and robust.
Expanded Data Source Support: The local data source now supports a wider range of file formats including Excel (.xlsx, .xls), CSV, and Parquet, alongside existing JSONL/JSON/TXT formats. Additionally, a new SQL data source has been introduced, allowing direct streaming from databases like PostgreSQL, MySQL, and SQLite.
Standardized Evaluation Result Format: A new EvalDetail Pydantic model has been introduced to standardize the output of all evaluators (both rules and LLMs). This model captures metric name, status (pass/fail), score, labels, and reasons, replacing the previous ModelRes and simplifying result processing and aggregation.
Comprehensive Suite of New LLM Evaluators: A vast array of new LLM-based evaluators has been added across various categories, including RAG evaluation (Answer Relevancy, Context Precision, Recall, Relevancy, Faithfulness), SFT instruction quality (Clarity, Task Difficulty), Meta-rater dimensions (Professionalism, Readability, Reasoning, Cleanliness), minor language detection, resume optimization, and VLM-based document parsing.
Enhanced UI and Gradio App Functionality: The Electron UI has been updated to dynamically display evaluation results, including new filtering capabilities by JSONL file paths and improved pie chart visualizations. The Gradio application has also been significantly refactored to support the new flexible evaluation pipeline configuration and dynamic field mapping.

Ignored Files

Ignored by pattern: .github/workflows/** (3)
- .github/workflows/IntegrationTest.yml
- .github/workflows/lint.yml
- .github/workflows/metrics-validation.yml

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a significant and impressive update to version 2.0.0, fundamentally refactoring the configuration and evaluation pipeline. The move from a rigid eval_group system to a flexible evaluator pipeline is a major architectural improvement, allowing for more complex and multi-faceted evaluations. The introduction of the agent framework with tool support, SQL data sources, and expanded file format support (Excel, CSV, Parquet) greatly enhances the capabilities of Dingo. The codebase refactoring, especially moving prompts into their respective LLM classes and simplifying the model registry, improves maintainability and clarity. The updates to the Gradio app and the Electron-based GUI also provide a much better user experience. Overall, this is a very strong update that makes Dingo a much more powerful and flexible tool.

gemini-code-assist · 2025-12-25T09:40:03Z

.github/scripts/check_imports.py

+#!/usr/bin/env python3
+"""检查所有Python文件是否可以成功编译和导入"""
+
+import os


The os module is imported but not used in this script. It can be safely removed to keep the imports clean.

gemini-code-assist · 2025-12-25T09:40:03Z

app/src/renderer/src/pages/main-home/components/pieChart.tsx

+        return Object.keys(data.type_ratio?.content||{}).some(key =>
            key.startsWith(firstLevelType + '-')
        );


The logic for hasSecondLevel and the corresponding getSecondLevelData function seems to be broken after the refactoring of the summary.json structure. The type_ratio now has a nested structure, and the keys are in the format QUALITY_BAD_COMPLETENESS.RuleColonEnd, using . as a separator instead of -. The current implementation still checks for key.startsWith(firstLevelType + '-'), which will likely never be true, breaking the drill-down functionality in the pie chart legend. This should be updated to correctly handle the new data structure and restore the drill-down feature, or the related code for drill-down should be removed if the feature is no longer intended to be supported.

pekopoke and others added 30 commits October 29, 2025 17:49

add image rule guide

a3a7f47

add image rule guide and fix

251a761

add image rule guide and fix

7457895

add image rule guide and fix

4622849

Merge pull request #237 from pekopoke/dev-lld

68697ab

add image rule guide

fix:Layout Prompt

9935d80

fix:Layout Prompt

80c7c74

Merge pull request #239 from chaserRen/rzf-1030

86502d1

fix:Layout Prompt

feat: adapte data structure

11166d7

feat: change static

9aa7103

Merge pull request #243 from decrystal/fix/ade/dev

674f639

Fix/ade/dev

feat: html_extract_compare example name

ba33675

x

5e36705

x

d8bd02a

feat: add 5 RAG eval metrics

aad8553

add docs

082e032

update metrics

a853b48

update lint ci

6708796

add ut

64173d1

fix ut

e479452

Merge pull request #244 from e06084/dev

1cd4e97

feat: add 5 RAG eval metrics

feat: dataset example中文件改名

54fd460

feat: sql连接，添加connect_args属性

c266034

feat: sql更新md，添加connect_args属性

52d18c2

feat: sql更新md，去除fields

2d8ccb0

feat: result_info中track_id改名dingo_id

de71a33

🎨 Auto-format code with pre-commit

4a3d7c7

Merge pull request #260 from shijinpjlab/dev_sql

c4dee94

feat: sql

e06084 and others added 27 commits December 23, 2025 18:19

feat: add examples in metrics (#314)

7464aa2

* feat: add Instruction Quality Evaluation * feat: add examples in metrics * 📚 Auto-update metrics documentation --------- Co-authored-by: GitHub Action <[email protected]>

fix: update score.py to use LLMTextQualityV5 instead of deleted module

d7072cf

feat: init agent&tool architecture (#311)

d09737d

update: fix tests and examples

d842a6c

🎨 Auto-format code with pre-commit

00f9b39

fix: resolve flake8 unused variable error

160beee

feat: fix spark

ab92b69

Merge pull request #316 from shijinpjlab/dev_spark

4e7ac3d

feat: fix spark

feat: adapt the new json

b053f8c

feat：remove console

24d76ba

feat:build web

d317688

feat：adopt table column width and make data reading function as publi…

038cee6

…c method

🎨 Auto-format code with pre-commit

a5ff0c0

Merge pull request #318 from tenwanft/dev_zeng

df0f9e3

feat:front page V2.0

feat: support parquet file

c93f3cb

🎨 Auto-format code with pre-commit

41b4aee

feat: gemini dingo/data/datasource/local.py

5ff278c

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Merge pull request #320 from shijinpjlab/dev_parquet

666246c

feat: support parquet file

chore: revert code changes to match dev branch, keep docs only

0533a4a

chore: cleanup docs and local.py to focus on examples and tests

10a90f2

chore: remove accidental test file from docs PR

984a17b

Merge pull request #321 from Kylie-dot-s/fix/docs-only

eadcf9d

Fix/docs only

feat: use_browser (#323)

bafd1bd

* feat: use_browser * feat: fix gemini * feat: lint

Merge pull request #322 from Kylie-dot-s/feat/update-all-examples-and…

b413024

…-tests Feat/update all examples and tests

feat: agent&tool docs/tests/examples (#319)

8981743

* feat: init agent&tool architecture * feat: agent&tool docs/tests/examples * fix bugs

Merge pull request #324 from MigoXLab/main

3329e72

docs: update wechat doc

feat: update v2.0.0

8a4fd02

gemini-code-assist bot reviewed Dec 25, 2025

View reviewed changes

shijinpjlab merged commit 176e192 into main Dec 25, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: update v2.0.0 #325

feat: update v2.0.0 #325

Uh oh!

shijinpjlab commented Dec 25, 2025

Uh oh!

gemini-code-assist bot commented Dec 25, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 25, 2025

Uh oh!

gemini-code-assist bot Dec 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

feat: update v2.0.0 #325

feat: update v2.0.0 #325

Uh oh!

Conversation

shijinpjlab commented Dec 25, 2025

Uh oh!

gemini-code-assist bot commented Dec 25, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 25, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants