Skip to content
@U4RASD

Unit for Research In Arabic Social and Digital Spaces

Part of the Arab Center for Research and Policy Studies (ACRPS)

Popular repositories Loading

  1. dalla-model-training dalla-model-training Public

    Dalla training recipe using Huggingface SFT trainer

    Python 8 1

  2. r-bpe r-bpe Public

    R-BPE: Improving BPE-Tokenizers with Token Reuse

    Python 5 2

  3. ar-ms-baseline ar-ms-baseline Public

    Training code to establish baseline for NAKBA NLP 2026: Arabic Manuscript Understanding Shared Task

    Python 3

  4. dalla-data-processing dalla-data-processing Public

    data processing pipeline used for the DALLA Models.

    C 1

  5. dalla-sentencepiece dalla-sentencepiece Public

    A tool to update an existing tokenizer model with new and common tokens from a newly trained tokenizer model. It works with any SentencePiece model.

    Python 1

  6. .github .github Public

Repositories

Showing 7 of 7 repositories
  • stance-nakba-2026 Public

    Arabic stance detection for the StanceNakba 2026 shared task.

    U4RASD/stance-nakba-2026’s past year of commit activity
    Python 0 0 0 0 Updated Mar 29, 2026
  • dalla-data-processing Public

    data processing pipeline used for the DALLA Models.

    U4RASD/dalla-data-processing’s past year of commit activity
    C 1 0 1 0 Updated Jan 29, 2026
  • ar-ms-baseline Public

    Training code to establish baseline for NAKBA NLP 2026: Arabic Manuscript Understanding Shared Task

    U4RASD/ar-ms-baseline’s past year of commit activity
    Python 3 0 0 0 Updated Jan 14, 2026
  • dalla-model-training Public

    Dalla training recipe using Huggingface SFT trainer

    U4RASD/dalla-model-training’s past year of commit activity
    Python 8 1 0 0 Updated Dec 16, 2025
  • dalla-sentencepiece Public

    A tool to update an existing tokenizer model with new and common tokens from a newly trained tokenizer model. It works with any SentencePiece model.

    U4RASD/dalla-sentencepiece’s past year of commit activity
    Python 1 0 0 0 Updated Dec 15, 2025
  • r-bpe Public

    R-BPE: Improving BPE-Tokenizers with Token Reuse

    U4RASD/r-bpe’s past year of commit activity
    Python 5 2 0 1 Updated Nov 26, 2025
  • .github Public
    U4RASD/.github’s past year of commit activity
    0 0 0 0 Updated Nov 26, 2025