Data Scientist at Event Registry
Artificial Intelligence • Natural Language Processing • Cross-lingual Language Models • Semi-Automatic Text Processing • Data Visualization
This repository contains personal development environment configurations and dotfiles. To install, run:
./setup.sh- Shell: Bash with Starship prompt featuring Catppuccin Mocha theme
- Terminal Multiplexer: Tmux with Catppuccin theme, vim-style key bindings, and TPM plugin management
- Text Editors: Nano and Vim configured with syntax highlighting and custom themes
- Version Control: Git with enhanced colorized output and productivity aliases (
lg,st,up) - Python Environment: UV package manager with convenient aliases
- Node.js Runtime: nvm (Node Version Manager) for version management
| Package | Description | GitHub Stars | PyPI |
|---|---|---|---|
| anonipy | Data anonymization library supporting multiple anonymization strategies and techniques | ||
| datachart | Flexible data visualization library with simple API and extensive customization options |
| Dataset | Description | GitHub Stars | Repository |
|---|---|---|---|
| OG2021 | Comprehensive dataset from the 2021 Tokyo Olympics | Clarin.si | |
| SloATOMIC 2020 | Slovene translation of the ATOMIC 2020 commonsense reasoning dataset | Clarin.si |
eriknovak/cookiecutter-ml-dvc — Template for machine learning experiments using DVC for version control and reproducibility (in development).
# Install pipx for running cookiecutter
pip install pipx
# Create a new project using the template
pipx run cookiecutter gh:eriknovak/cookiecutter-ml-dvceriknovak/cookiecutter-ml-hpc — Template for machine learning experiments on HPC clusters with SLURM workload manager (in development).
# Install pipx for running cookiecutter
pip install pipx
# Create a new project using the template
pipx run cookiecutter gh:eriknovak/cookiecutter-ml-hpc



