Built upon Llama-3.1 70B, NANDA 87B has been trained on a curated Hindi-English dataset with over 65 billion Hindi tokens. A custom Hindi-centric tokenizer boosts efficiency, reducing both training ...