课程: AI Data Strategy: Data Procurement and Storage
免费学习该课程!
今天就开通帐号,24,700 门业界名师课程任您挑!
Best practices for sourcing unstructured data
- [Instructor] In the last video, you saw how to source structured data for ML models, but generative AI brings us into a whole new world, one where we need vast amounts of unstructured data, like text, images, and code. Apart from the sheer volume, the challenge with generative AI is about finding diverse high quality data that you need to help your models learn and consequently generate meaningful content. Teams often think they need to collect every bit of data they can find, but having the biggest data set doesn't guarantee success. It's about having the right kind of data diversity. Think about a large language model like GPT. It's not just trained on perfectly written books and articles. It must also understand how people actually communicate, everything from formal documents to casual conversations, technical manuals to creative writing. This diversity is what allows the model to generate appropriate responses in different contexts. Remember the star framework we discussed…
内容
-
-
-
-
(已锁定)
Sourcing structured data for ML-driven AI products6 分钟 50 秒
-
(已锁定)
Best practices for sourcing unstructured data4 分钟 32 秒
-
(已锁定)
Understanding bias in traditional ML systems6 分钟 42 秒
-
Bias in generative AI: Challenges and mitigation strategies6 分钟 19 秒
-
(已锁定)
Framework for bias mitigation in AI4 分钟 2 秒
-
(已锁定)
Building intelligent systems with data protection5 分钟 13 秒
-
(已锁定)
Open data platforms: Democratizing AI development5 分钟 1 秒
-
(已锁定)
Leveraging APIs for AI6 分钟 45 秒
-
(已锁定)
Building sustainable data ecosystems5 分钟 3 秒
-
(已锁定)
-
-
-