Skip to content

Commit 4950e97

Browse files
Update README.MD
1 parent fa64404 commit 4950e97

1 file changed

Lines changed: 5 additions & 18 deletions

File tree

README.MD

Lines changed: 5 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -24,28 +24,15 @@ We manage a few repos externally:
2424
- [PubMed Downloader](https://github.com/shloknatarajan/PubMed-Downloader): This repo is used to download all the markdown files from the PMIDs represented in `var_drug_ann.tsv`
2525
- [Huggingface/AutoGKB](https://huggingface.co/datasets/shlokn/autogkb): This converts the annotations and article text into a dataset format for benchmarking
2626

27-
## Progress Tracker
28-
| Category | Task | Status |
29-
| --- | --- | --- |
30-
| Initial Download | Download the zip of variants from pharmgkb ||
31-
| | Get a PMID list from the variants tsv (column PMID) ||
32-
| | Convert the PMID to PMCID ||
33-
| | Update to use non-official pmid to pmcid (aaron's method) | |
34-
| | Fetch the content from the PMCID ||
35-
| Benchmark | Create pairings of annotations to articles ||
36-
| | Create a niave score of number of matches | |
37-
| | Create group wise score | |
38-
| | Look into advanced scoring based on distance from truth per term | |
39-
| Workflows | Integrate Aaron's current approach ||
40-
| | Document on individual annotation meanings | |
41-
| | Delegate annotation groupings to team members | |
42-
| New Article Fetching | Replicate PharGKB current workflow | |
43-
4427
## System Overview
4528
![Annotations Diagram](assets/annotations_diagram.svg)
4629

4730
## Downloading the data
4831
```
4932
pixi run gdown —-id 1qtQWvi0x_k5_JofgrfsgkWzlIdb6isr9
5033
unzip autogkb-data.zip
51-
```
34+
35+
OR
36+
37+
pixi run setup-repo
38+
```

0 commit comments

Comments
 (0)