Skip to content

Conversation

@marcellodebernardi
Copy link
Contributor

This PR is a depressingly confusing mish-mash of several bugfixes, improvements, and clean up. The main changes are:

  1. The DatasetGenerator has been simplified, cleaned up, made properly async, and can now handle column-wise data augmentation (earlier only row-wise).
  2. The traces of the smolagents agents are now logged to mlflow for easy tracking
  3. Improvements to mlflow tracking, chain of thought is more informative, etc
  4. Include the EDA report in the model bundle and mlflow for convenience
  5. Miscelanneous bugfixes

@marcellodebernardi marcellodebernardi added bug Something isn't working documentation Improvements or additions to documentation enhancement New feature or request labels May 19, 2025
@plexe-ai plexe-ai deleted a comment from jazzberry-ai bot May 19, 2025
@marcellodebernardi marcellodebernardi merged commit 574b032 into main May 19, 2025
5 checks passed
@marcellodebernardi marcellodebernardi deleted the fix/dataset-generator-cleanup branch May 19, 2025 09:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working documentation Improvements or additions to documentation enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants