Creates an HTML that formats the organisms from a Kraken Metagenomics Excel output file, as an NCBI search, to the organism's NCBI Dashboard, then checks a hotlist of organisms against the main file.
This py script takes the excel file that reuslts from a kraken metagenomics anlysis.
Do this:
- Insert "Thiomargarita magnifica" into the kraken file to serve as a control for the cross reference to the hot-list.
- Change the name of the organism colum to "organism"
Instructions:
- run the sript.
- use the file browser to choose an ecel file of organisms
- use the file browser to choose an excel file of organisms to cross check if in the Kreaken file
- Click on the "Link" button to run the script.
- The script will import the excel files, crosscheck for common organisms, then it creates an HTML with links to NCBI
- Runs a control to make sure the cross reference is working: "Thiomargarita magnifica"
- The script waits 2 seconds and then opens the HTML doc (titled: zzz.html) in the default browser.
- Sorts organism list on "Reads"
Requires import:
- Pandas
- PySimpleGUI
- time
- csv, os
- webbrowser
Ancillary required files, in same folder:
- Image files for LED
- Hot-list of organisms to cross check if in Kraken file.