Skip to content

Add uploading of datasets. #52

@ManuelAlvarezC

Description

@ManuelAlvarezC

Description

Trello card :https://trello.com/c/tb08vrGi

We need to upload the datasets generated by our data sources, to make them easily accesible to other teams.

To do we need to:

  1. Create a set of default parameters for each datasource, which may be empty.
  2. Create a function that given a data_source, runs it with the defined parameters, store the result as a csv and packs it in a folder with a copy of the audit and metapackage.json files.
  3. Make a function, using this notebook as a template that takes the path to a data package and uploads to kaggle.
  4. Create a function that take no arguments, and iterate through the data sources, generating the datapackages and uploading them to kaggle.
  5. Create a github action that runs the function of step 4 each 24 hours.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions