Skip to content
View eriknovak's full-sized avatar

Highlights

  • Pro

Organizations

@qminer

Block or report eriknovak

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ErikNovak/README.md

Erik Novak

Data Scientist at Event Registry

Research Focus

Artificial Intelligence • Natural Language Processing • Cross-lingual Language Models • Semi-Automatic Text Processing • Data Visualization

Connect: LinkedIn | Homepage

Development Environment

This repository contains personal development environment configurations and dotfiles. To install, run:

./setup.sh

Configuration Stack

  • Shell: Bash with Starship prompt featuring Catppuccin Mocha theme
  • Terminal Multiplexer: Tmux with Catppuccin theme, vim-style key bindings, and TPM plugin management
  • Text Editors: Nano and Vim configured with syntax highlighting and custom themes
  • Version Control: Git with enhanced colorized output and productivity aliases (lg, st, up)
  • Python Environment: UV package manager with convenient aliases
  • Node.js Runtime: nvm (Node Version Manager) for version management

Open Source Contributions

Python Packages

Package Description GitHub Stars PyPI
anonipy Data anonymization library supporting multiple anonymization strategies and techniques Stars PyPi
datachart Flexible data visualization library with simple API and extensive customization options Stars PyPi

Research Datasets

Dataset Description GitHub Stars Repository
OG2021 Comprehensive dataset from the 2021 Tokyo Olympics Stars Clarin.si
SloATOMIC 2020 Slovene translation of the ATOMIC 2020 commonsense reasoning dataset Stars Clarin.si

Project Templates

Machine Learning with DVC

eriknovak/cookiecutter-ml-dvc — Template for machine learning experiments using DVC for version control and reproducibility (in development).

# Install pipx for running cookiecutter
pip install pipx
# Create a new project using the template
pipx run cookiecutter gh:eriknovak/cookiecutter-ml-dvc

Machine Learning on HPC Systems

eriknovak/cookiecutter-ml-hpc — Template for machine learning experiments on HPC clusters with SLURM workload manager (in development).

# Install pipx for running cookiecutter
pip install pipx
# Create a new project using the template
pipx run cookiecutter gh:eriknovak/cookiecutter-ml-hpc

Pinned Loading

  1. anonipy anonipy Public

    Data anonymization package, supporting different anonymization strategies

    Python 7 3

  2. datachart datachart Public

    Data visualization package, simple to use, highly customizable

    Python 3 1

  3. cookiecutter-ml-dvc cookiecutter-ml-dvc Public

    Template for cookiecutter project for machine learning experiments with DVC

    Python

  4. cookiecutter-ml-hpc cookiecutter-ml-hpc Public

    The cookiecutter template for creating projects for running machine learning experiments on HPCs (specifically SLURM)

    Shell