Skip to content

jankovicgd/csnFramework

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

105 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

csnFramework

Simple framework for searching for the best k for kNN and evaluating different optimisation methods in python for the course in Machine Learning at Technical University Vienna.

How to use

Run app.py with python.

Datasets

To use put clean datasets in the datasets folder with the target value named 'target'.

k search

Global Brute force search

Simply brute force searches from 1 - square root of n plus 10% above. Warning: Could experience long run time

Local Brute force search

Based on the premise that the best k for kNN is in the area of sqrt(n) where n is number of instances for the dataset. The custom search allows for searching within 10% of n in the interval around sqrt(n) and brute force searching in that region.

Randomized search

Binary search

Optimisation strategies

The second part of the program checks for different sklearn optimisation strategies and measures fit_time, score_time and runtime of the cross validation. The strategies are kd-tree, ball-tree and brute force.

Contributors

Nikola Jankovic

Slimane Makhlouf

Christian Ryan

About

Simple framework for evaluating optimisation strategies for knn

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages