Skip to content

add all PIE dataset builder scripts from PyTorch-IE#22

Open
ArneBinder wants to merge 3 commits into
mainfrom
add_pytorch_ie_dataset_builders
Open

add all PIE dataset builder scripts from PyTorch-IE#22
ArneBinder wants to merge 3 commits into
mainfrom
add_pytorch_ie_dataset_builders

Conversation

@ArneBinder
Copy link
Copy Markdown
Owner

@ArneBinder ArneBinder commented Nov 7, 2023

Requires: #21

TODO:

@ArneBinder ArneBinder force-pushed the add_pytorch_ie_dataset_builders branch 2 times, most recently from e706aed to 6c0e68d Compare November 7, 2023 19:32
ArneBinder added a commit to ArneBinder/pytorch-ie that referenced this pull request Nov 7, 2023
ArneBinder added a commit to ArneBinder/pytorch-ie that referenced this pull request Nov 8, 2023
* create document fixture without any dataset class

* remove any dataset related classes; remove statistic implementations

* remove PIE dataset loading scripts (they will be added to pie-datasets, see ArneBinder/pie-datasets#22)

* remove dataset specific nox test sessions

* cleanup

* add pandas dependency because it is required in metrics (f1 and statistics)

* fix nox mypy session

* export RequiresDocumentTypeMixin and DocumentStatistic in core

* remove workflow to upload datasets to the Huggingface hub (in favor of ArneBinder/pie-datasets#27)
@ArneBinder ArneBinder force-pushed the add_pytorch_ie_dataset_builders branch from 6c0e68d to d3c565d Compare November 8, 2023 12:59
@codecov
Copy link
Copy Markdown

codecov Bot commented Nov 8, 2023

Codecov Report

❌ Patch coverage is 66.21005% with 74 lines in your changes missing coverage. Please review.
✅ Project coverage is 88.45%. Comparing base (ffe3ad5) to head (d3c565d).
⚠️ Report is 582 commits behind head on main.

Files with missing lines Patch % Lines
dataset_builders/pie/germeval_14/germeval_14.py 59.25% 11 Missing ⚠️
...ity_recognition/german_legal_entity_recognition.py 73.68% 10 Missing ⚠️
dataset_builders/pie/conll2002/conll2002.py 64.00% 9 Missing ⚠️
dataset_builders/pie/conllpp/conllpp.py 64.00% 9 Missing ⚠️
dataset_builders/pie/germaner/germaner.py 64.00% 9 Missing ⚠️
dataset_builders/pie/ncbi_disease/ncbi_disease.py 64.00% 9 Missing ⚠️
dataset_builders/pie/wnut_17/wnut_17.py 64.00% 9 Missing ⚠️
dataset_builders/pie/wikiann/wikiann.py 72.41% 8 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main      #22      +/-   ##
==========================================
- Coverage   92.51%   88.45%   -4.06%     
==========================================
  Files          15       23       +8     
  Lines        1202     1421     +219     
==========================================
+ Hits         1112     1257     +145     
- Misses         90      164      +74     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant