google · igooch · Feb 11, 2026 · Feb 11, 2026 · Feb 11, 2026 · Feb 11, 2026
@@ -10,8 +10,10 @@
 .idea
 .vscode
 .envrc
+tmp/
 
 # virtualenv/venv directories
+**/.venv/
 /venv/
 /bin/
 /include/

@@ -0,0 +1 @@
+3.12.9
@@ -62,15 +62,15 @@ Adding the new model needs to following the naming convention that Tunix support
 ## AutoModel
 
 `AutoModel` provides a unified interface for instantiating Tunix models from
-pretrained checkpoints, similar to the Hugging Face `AutoModel` API. It allows
+pretrained checkpoints, similar to the Huggingface `AutoModel` API. It allows
 you to load a model simply by providing its `model_id`, handling the download
 and initialization for you.
 
 ### Basic Usage
 
 To load a model, use the `AutoModel.from_pretrained` method with the model
 identifier and your JAX sharding mesh. By default this will download the model
-from HuggingFace.
+from Huggingface.
 
 ```python
 from tunix.models.automodel import AutoModel
@@ -80,9 +80,9 @@ import jax
 mesh = jax.make_mesh((1, 1), ("fsdp", "tp"), axis_types=(jax.sharding.AxisType.Auto,) * 2)
 
 # 2. Load the model
-# By default, this downloads from Hugging Face.
+# By default, this downloads from Huggingface.
 model, model_path = AutoModel.from_pretrained(
-  model_id="google/gemma-2-2b-it",
+  model_id="google/gemma-2-2b-it", # Using HF id as model_id
   mesh=mesh
 )
 
@@ -94,20 +94,19 @@ print(f"Model loaded from: {model_path}")
 You can load models from different sources (e.g., Kaggle, GCS, etc.) using the
 `model_source` argument.
 
-#### From HuggingFace:
+#### From Huggingface:
 
 This is the default choice (`ModelSource.HUGGINGFACE`) as shown in the
 example above.
 
 #### From Kaggle:
 
-For Kaggle, you must provide the `model_id` which is the Hugging Face identifier
-(to determine the model configuration) and the `model_path` which is the Kaggle
+For Kaggle, you must provide the `model_id` which is the Huggingface identifier or model_config_id (see [Naming Conventions](models.md#naming-conventions)) to determine the model configuration and the `model_path` which is the Kaggle
 Hub model identifier (used to download the model from Kaggle).
 
 ```python
 model, model_path = AutoModel.from_pretrained(
-    model_id="google/gemma2-2b-it",
+    model_id="gemma2_2b_it", # Using model_config_id as model_id
     mesh=mesh,
     model_source=ModelSource.KAGGLE,
     model_path="google/gemma-2/flax/gemma2-2b-it",
@@ -120,13 +119,12 @@ For example the `model_path` for the `google/gemma-2/flax/gemma2-2b-it` is extra
 
 #### From GCS:
 
-For GCS, you must provide the `model_id` which is the Hugging Face identifier
-(to determine the model configuration) and the `model_path` (the actual GCS
+For GCS, you must provide the `model_id` which is the Huggingface identifier or model_config_id (see [Naming Conventions](models.md#naming-conventions)) to determine the model configuration and the `model_path` (the actual GCS
 location).
 
 ```python
 model, model_path = AutoModel.from_pretrained(
-    model_id="google/gemma-2-2b-it",
+    model_id="gemma2_2b_it", # Using model_config_id as model_id
     mesh=mesh,
     model_source=ModelSource.GCS,
     model_path="gs://my-bucket/gemma-2-2b-it"
@@ -139,7 +137,7 @@ Optionally, you can also provide the `model_download_path` argument, which
 specifies where the model is to be downloaded to. Depending on the
 `model_source` the effect of specifying this variable is different:
 
-*   **Hugging Face**: Files are downloaded directly to this directory.
+*   **Huggingface**: Files are downloaded directly to this directory.
 *   **Kaggle**: Sets the `KAGGLEHUB_CACHE` environment variable to this path.
 *   **GCS**: No-op.
 *   **Internal**: Files are copied to this directory. If omitted, the model is loaded directly from the `model_path`. This mode (Internal) is not supported in OSS version.
@@ -148,21 +146,27 @@ specifies where the model is to be downloaded to. Depending on the
 
 This section outlines the naming conventions used within Tunix for model
 identification and configuration. These conventions ensure consistency when
-loading models from various sources like Hugging Face or Kaggle.
+loading models from various sources like Huggingface or Kaggle.
 
 The `ModelNaming` dataclass handles the parsing and standardization of model names.
 
-*   **`model_id`**: The full model name identifier (case sensitive), as it appears
-    on Hugging Face, including the parent directory. For example,
+*   **`model_id`**: This is a unique identifier used to identifty the model in mind and extract the family, version, and desired config from. Tunix support two identifiers as the `model_id`:
+    1. **Huggingface (HF) IDs:** The full model name identifier (case sensitive), as it appears
+    on Huggingface, including the parent directory. 
+      * **Extracting model_id from HF**: For example,
     `meta-llama/Llama-3.1-8B` is extracted as shown below:
-      ![Hugging Face extracting Model ID](images/model_id_huggingface.png){: width="75%"}
+      ![Huggingface extracting Model ID](images/model_id_huggingface.png){: width="75%"}
+
+    2. **Native Tunix model_configs:** the `model_config_id` representing the exact config from the model class can be used directly as the `model_id`. In this case it will also be treated as the `model_name`.
+      * **Extracting model_id from model_config_id**: In this case, you would need to refer to the source code (`model.py`) for each model family and select the config id from the `ModelConfig` class, for example `llama3p1_8b` from the llama [model code](https://github.com/google/tunix/blob/main/models/llama3/model.py;bpv=1;bpt=1;l=138).
 
 
 *   **`model_name`**: The unique full name identifier of the model. This
     corresponds to the full name and should match exactly with the model name
     used in Hugging Face or Kaggle. It is typically all lowercase and formatted
-    as `<model-family>-<model-version>`.
-    *   *Example*: `gemma-2b`, `llama-3.1-8b`, `gemma2-2b-it`.
+    as `<model-family>-<model-version>` (when HF is used for model_id) or `<model-family>_<model-version>` (when model_config_id is used for model_id) .
+    *   *Example for HF as model_id*: `gemma-2b`, `llama-3.1-8b`, `gemma-2-2b-it`.
+    * *Example for model_config_id as model_id*: `gemma_2b`, `llama3p1_8b`, `gemma2_2b_it`.
 
 *   **`model_family`**: The standardized model family. Unnecessary hyphens are
     removed, and versions are standardized (e.g., replacing dot with `p`).

@@ -184,7 +184,7 @@ Next, we load the English-French translation dataset. Note you can use your own
 datasets too (PyGrain, Hugging Face dataset, TFDS, etc.).
 
 ```sh
-gsutil cp gs://gemma-data/tokenizers/tokenizer_gemma3.model .
+gcloud storage cp gs://gemma-data/tokenizers/tokenizer_gemma3.model .
 ```
 
 ```python

@@ -68,9 +68,20 @@ Create a v5litepod-8 TPU VM in GCE:
 
 Reference: `TPU Runtime Versions <https://docs.cloud.google.com/tpu/docs/runtimes?hl=en&_gl=1*1tpeg3j*_ga*MTk1NzE5MjMyNy4xNzYwOTEwNjk3*_ga_WH2QY8WWF5*czE3NjIxNTU1OTEkbzE3JGcwJHQxNzYyMTU1NTkxJGo2MCRsMCRoMA..#training-v5p-v5e>`_
 
+```
+gcloud compute tpus tpu-vm create v5-8 \
+  --zone=us-west1-c \
+  --accelerator-type=v5litepod-8 \
+  --version=v2-alpha-tpuv5-lite
+```
+
 2. Configure VM
 ~~~~~~~~~~~~~~~~
 
+```
+gcloud compute tpus tpu-vm ssh --zone "us-west1-c" "v5-8"
+```
+
 SSH into the VM using the supplied gcloud command, then run:
 
 .. code-block:: bash
-Original file line number
+Diff line change
@@ Expand Up / @@ -10,8 +10,10 @@ @@
     .idea
     .vscode
     .envrc
+    tmp/
     # virtualenv/venv directories
+    **/.venv/
     /venv/
     /bin/
     /include/
@@ Expand Down @@