rohitg00 · kukudan · Jun 4, 2026 · Jun 4, 2026 · Jun 4, 2026
diff --git a/phases/00-setup-and-tooling/09-data-management/docs/en.md b/phases/00-setup-and-tooling/09-data-management/docs/en.md
@@ -57,7 +57,7 @@ This downloads the IMDB movie review dataset. After the first download, it loads
 Some datasets are too large to fit on disk. Streaming loads them row by row without downloading the full thing.
 
 ```python
-dataset = load_dataset("wikimedia/wikipedia", "20220301.en", split="train", streaming=True)
+dataset = load_dataset("wikimedia/wikipedia", "20231101.en", split="train", streaming=True)
 
 for i, example in enumerate(dataset):
     print(example["title"])
@@ -140,7 +140,7 @@ Model weights and large datasets should not go into git. Three options:
 
 **Option A: .gitignore (simplest)**
 
-```
+```text
 *.bin
 *.safetensors
 *.pt