Skip to content

Conversation

@niekdejonge
Copy link
Collaborator

The data split was splitting separately for pos and neg. This has been fixed here.

In addition I made some changes to the training pipeline.
I removed the StoreTrainingData class. This previously handled the storing and loading of the data split, but I now found pretty confusing and unnecessary complex. Instead I now have a single split_data function. We don't store pos and neg separately anymore, but just handle the split during loading.

The file names are now stored/generated in SettingsMS2DeepScore. This means passing the spectrum file name to SettingsMS2DeepScore as well. I needed a place to store these file names and this seemed most logical. The added benefit is that the user can, but doesn't have to specify any of the results folders. And the settings actually store the input and output file locations, which is nice if you want to trace back the origin of older models.

@sonarqubecloud
Copy link

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants