diff --git a/README.md b/README.md index 8d25688..af00c44 100644 --- a/README.md +++ b/README.md @@ -62,13 +62,16 @@ Project is divided into two modules: ### bigfiles +- [How to Run](bigfiles/README.md#how-to-run) - bigfile is file that does not fit to RAM - module for comparing big files - written in Scala - more about bigfiles module could be found in [bigfiles README](bigfiles/README.md) + ### smallfiles +- [How to Run](smallfiles/README.md#how-to-run) - smallfile is file that fits to RAM - module for comparing small files - written in Python diff --git a/bigfiles/README.md b/bigfiles/README.md index fe24e5e..e3bdd5a 100644 --- a/bigfiles/README.md +++ b/bigfiles/README.md @@ -1,6 +1,6 @@ # Scala CPS-Dataset-Comparison -This is scala implementation of the project. It is used for comparing big files. +This is scala implementation of the project. It is used for comparing big files (files that can not fit to RAM). - [How to run](#how-to-run) - [Requirements](#requirements) @@ -15,6 +15,7 @@ Then run: ```bash spark-submit target/scala-2.12/dataset-comparison-assembly-1.0.jar -o --inputA --inputB + ``` ### Parameters: | Parameter | Description | Required | @@ -26,6 +27,18 @@ spark-submit target/scala-2.12/dataset-comparison-assembly-1.0.jar -o This module is not yet implemented. + +This is python implementation of the project. It is used for comparing small files (files fitting into RAM). - [Create and run environment](#create-and-run-environment) - [Run main](#run-main)