modelscope / data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Apache License 2.0
2.63k stars 166 forks source link

use pydantic types #422

Closed drcege closed 3 weeks ago

drcege commented 4 weeks ago

Tuple[str] or Tuple[int] was used incorrectly. They indicate that the tuple can only contain a single element, which differs from List[str] or List[int], which are variable-length sequences by default. After the modification:

Refactor all type annotations from jsonargparse to Pydantic.