Mateo Dulce Rubio*, Siqi Zeng*, Qi Wang, Didier Alvarado, Francisco Moreno, Hoda Heidari, and Fei Fang
This repository contains all the instructions to replicate the results of the paper "RELand: Risk Estimation of Landmines via Interpretable Invariant Risk Minimization"
Required packages:
pandas
numpy
sklearn
pytorch
lightgbm
pytorch_tabnet
The dataset used for experiments in paper is in /processed_dataset folder.
We provide three validation methods in our paper in /train_val_stream folder.
blockCV: blockCV methodbolivar: blockV methodtransfer: transferCV method
We provide an example in bash run_reland.sh. Some configurations include:
--timestampan unique experiment string--municipiouse which validation method, eitherblockCV,bolivaror ``--subsetuse which subset of features, single (distance to historical landine), geo (geospatial features) or full (all 70 features)--modelwhich model to use,TabCmptis the RELand model, other options can beMLP,TabNet,LR,RF,SVM,LGBM--objectiveusing irm, erm, or pnorm--n_stepnumber of decision blocks--warm_startdirectory with checkpoints
Your final result for this run will be stored under /experiments/<timestamp> that contains
- all current
.pyfiles in the root directory - a
<municipality>.pthmodel for each municipality - an
<municipality>.pngimage for each municipality that visualizes the ground truth and prediction config.jsonwith current configuration (hyper)parametersmetrics.jsonthat contains all 4 metricspredicted_proba.csvthat combines validation prediction for all municipalitiesfeature_importance.csvwith global feature importance if applicable. Black-box models generate -1 values for all features.
We borrow and edit packages including ood-bench, scikit-learn/tree, pytorch-tabnet.