Skip to content

write sync metadata alongside files? #42

@jeremybmerrill

Description

@jeremybmerrill

Would y'all accept a PR that would have datakit write a zero-size placeholder with metadata matching the date of the most recent write of the relevant file to S3?

e.g. writing data/clean/my_big_file.csv.datakit_placeholder when data/clean/my_big_file.csv is uploaded to S3.

The motivation is that I do not remember when I most recently synced. When it comes time to delete big old files, I want greater confidence that the most recent version is synced to S3. And, if I come back to a project, re-run a notebook, and find that a big data file no longer exists on disk, I'd love to have a reminder that it's on datakit (versus my external HDD or gone forever or on a colleague's laptop or whatever).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions