Skip to content

Issue fixes#49

Merged
radoslawrolka merged 10 commits into
masterfrom
issue_fixes
Jan 11, 2026
Merged

Issue fixes#49
radoslawrolka merged 10 commits into
masterfrom
issue_fixes

Conversation

@WerWojtas

@WerWojtas WerWojtas commented Jan 10, 2026

Copy link
Copy Markdown
Collaborator

📝 Description

  • refactored elixir datasets into smaller packages
  • changed readme
  • added more description to examples
  • updated tests

🎯 Type of Changes

Remove the lines that do not apply:

  • 📚 Documentation update
  • 🔄 Refactoring

@WerWojtas WerWojtas marked this pull request as ready for review January 10, 2026 15:24
WeronikaW-REM and others added 2 commits January 10, 2026 16:24
Co-authored-by: Weronika <weronikaw@remitly.com>

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This pull request refactors the ElixirDatasets library into smaller, more modular packages, updates documentation, and reorganizes tests. The main module now delegates to specialized modules for different concerns (loading, repository management, info fetching, filtering, streaming).

Changes:

  • Refactored monolithic lib/elixir_datasets.ex into separate modules: Loader, Repository, Info, Filter, and Streaming
  • Reorganized tests from single file into focused test modules matching the new module structure
  • Updated README with simplified quick start and examples
  • Replaced examples/example_1.livemd with two comprehensive example files: usage_examples.livemd and integration_examples.livemd

Reviewed changes

Copilot reviewed 19 out of 19 changed files in this pull request and generated 4 comments.

Show a summary per file
File Description
lib/elixir_datasets.ex Refactored to delegate functions to specialized modules
lib/elixir_datasets/dataset_loader.ex New module handling dataset loading logic
lib/elixir_datasets/repository.ex New module for repository management
lib/elixir_datasets/info_getter.ex New module for fetching dataset metadata
lib/elixir_datasets/streaming.ex New module for streaming functionality
lib/elixir_datasets/filter.ex New module for filtering datasets
lib/elixir_datasets/huggingface/hub.ex Added Path.expand() to cache_dir
lib/elixir_datasets/utils/file_loader.ex Fixed iex examples formatting
test/elixir_datasets_test.exs Simplified to public API integration tests
test/elixir_datasets/streaming_test.exs New tests for streaming functionality
test/elixir_datasets/repository_test.exs New tests for repository module
test/elixir_datasets/info_getter_test.exs New tests for info getter module
test/elixir_datasets/filter_test.exs New tests for filter module
test/elixir_datasets/dataset_loader_test.exs New tests for loader module
test/elixir_datasets/huggingface/hub_test.exs Removed debug IO.puts statements
README.md Simplified documentation with focus on quick start
examples/usage_examples.livemd New comprehensive usage examples
examples/integration_examples.livemd New integration examples with Nx/Axon/Bumblebee
examples/example_1.livemd Removed (replaced by new example files)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread examples/usage_examples.livemd
Comment thread examples/usage_examples.livemd
Comment thread lib/elixir_datasets/repository.ex Outdated
Comment thread lib/elixir_datasets/dataset_loader.ex
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
@radoslawrolka radoslawrolka merged commit aa86532 into master Jan 11, 2026
1 check passed
@radoslawrolka radoslawrolka deleted the issue_fixes branch January 11, 2026 16:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants