cyber2a / cyber2a-course

Online materials for the Cyber2A course on AI for Arctic research
https://cyber2a.github.io/cyber2a-course/
Apache License 2.0
0 stars 0 forks source link

Lesson - AI-ready training datasets #8

Open carmengg opened 8 months ago

carmengg commented 8 months ago

Design principles for an AI-ready training dataset

Synopsis

Introduce principles, approaches, tools, and strategies to create a high- quality AI-ready training dataset that is diverse, sizable, representative, and minimizes data bias for thoughtful AI research.

Learning outcome

Trainees will become familiar with tools for training data creation and gain skills to correctly annotate and document training data and share the data with the broader research community.

AI tools

CVAT (Computer Vision Annotation Tool), PDG data annotation platform