Skip to content

Latest commit

 

History

History
12 lines (8 loc) · 579 Bytes

File metadata and controls

12 lines (8 loc) · 579 Bytes

LargeFileProcessor

This is a large file processor which deals with files which are bigger than the jvm memory. This sorts the text contents of the original file and write into a different file using external sorting techniques.

Data

Input data has to be placed under src\main\resources\data\ folder and output data will be get generated under src\main\resources\output\ folder.

Test data are from https://www.kaggle.com/freecodecamp/all-posts-public-main-chatroom

Test 1 : freecodecamp_casual_chatroom_anon.csv (1 GB)

Test 2 : freecodecamp_casual_chatroom.csv (2.6 GB)