Video: Best Practices for Developing Machine Learning Algorithms for Video
Learn more about machine learning and AI at Streaming Media's next event.
Read the complete transcript of this clip:
Josh Gray: As we've gone after a lot of the projects where we applied techniques with large data sets, you know, you tend to see some of the kind of similar impediments. One is that data quality on the front end is often a challenge. It's a common misconception that you can just take a mess of data and throw it at an AI engine and magic will show out the other side. That's just not true.
We find that making sure that you're clear about what's in your datasets that you're going to be using to train, what are the semantics of the different attributes and the properties that you're going to feed into, the model development is really important. Being very crisp in what you want out of this thing. Do you want a classifier with keywords? Do you want rankings? Do you want comparative outputs? Understanding what are the signals of value that you're trying to train for and then how those relate to the attributes of your datasets, then making sure that throughout that stack that you're driving for clarity and definition, and don't just assume that it's mess in one side, magic out the other.
David Clevinger: I'm going to add to that and say that I think any strategy also has to include a data strategy, how you are training that algorithm, how you are helping it to understand what it is you want out of it.
I think my colleague over here is completely right. You need to know exactly what it is you're going to get out of it, but you need to know how you're going to get there as well. I think that also includes metadata flexibility. If you don't have a flexible metadata structure or someone on your team that understands how to build flexible, relational either hierarchical or relative metadata structures, it's going to be difficult for you to manage that on an ongoing basis. I think you have to have a training strategy going in.
Nadine Krefetz: Is there metadata-as-a-service?
Josh Gray: As we've gone after a lot of the projects where we applied techniques with large data sets, you know, you tend to see some of the kind of similar impediments. One is that data quality on the front end is often a challenge. It's a common misconception that you can just take a mess of data and throw it at an AI engine and magic will show out the other side. That's just not true.
We find that making sure that you're clear about what's in your datasets that you're going to be using to train, what are the semantics of the different attributes and the properties that you're going to feed into, the model development is really important. Being very crisp in what you want out of this thing. Do you want a classifier with keywords? Do you want rankings? Do you want comparative outputs? Understanding what are the signals of value that you're trying to train for and then how those relate to the attributes of your datasets, then making sure that throughout that stack that you're driving for clarity and definition, and don't just assume that it's mess in one side, magic out the other.
David Clevinger: I'm going to add to that and say that I think any strategy also has to include a data strategy, how you are training that algorithm, how you are helping it to understand what it is you want out of it.
I think my colleague over here is completely right. You need to know exactly what it is you're going to get out of it, but you need to know how you're going to get there as well. I think that also includes metadata flexibility. If you don't have a flexible metadata structure or someone on your team that understands how to build flexible, relational either hierarchical or relative metadata structures, it's going to be difficult for you to manage that on an ongoing basis. I think you have to have a training strategy going in.
Nadine Krefetz: Is there metadata-as-a-service?
David Clevinger: Well, that's a great question. Companies like IBM might be building a metadata-as-a-service solution that you might be seeing in the near future, but there are other companies that do this already of course.
Then they do it in a variety of different verticals. It's typically done at the vertical level. You can find healthcare metadata companies. You can find financial services metadata companies. It doesn't really exist for M&E in a completely structured way because everyone is a little bit different. Sports versus movies versus what-have-you. But you could certainly use some existing products to build a metadata service.
Well, that's a great question. Companies like IBM might be building a metadata-as-a-service solution that you might be seeing in the near future, but there are other companies that do this already of course.
Then they do it in a variety of different verticals. It's typically done at the vertical level. You can find healthcare metadata companies. You can find financial services metadata companies. It doesn't really exist for M&E in a completely structured way because everyone is a little bit different. Sports versus movies versus what-have-you. But you could certainly use some existing products to build a metadata service.
Related Articles
Anvato, Google Cloud North American Sales Lead Adam Handman explains how integrated machine learning augments Google Cloud's media workflow in this clip from his presentation at Streaming Media West 2018.
18 Mar 2019
Limelight's Jason Hofmann, Citrix' Josh Gray, and REELY's Cullen Gallagher discuss best practices for training AI systems at Streaming Media East 2018.
12 Nov 2018
Google's Matthieu Lorrain cautions of the risks of doing AI for its own sake in this clip from Streaming Media West 2018.
08 Nov 2018
RealEyes Director of Technology Jun Heider discusses the importance of internal self-assessment and which use-case elements to consider when choosing a platform for video AI in this clip from Streaming Media East 2018.
01 Nov 2018
RealEyes Media Director of Technology Jun Heider identifies the key players in the AI platform space in this clip from Streaming Media East 2018.
29 Oct 2018
RealEyes Director of Technology Jun Heider outlines the first steps in choosing an AI platform in this clip from his presentation at Streaming Media East 2018.
25 Oct 2018
Microsoft Principal Product Manager Rafah Hosn makes the case for reinforcement learning as a machine learning paradigm for content personalization in this clip from Streaming Media East 2018.
22 Oct 2018
Microsoft Principal Product Manager Rafah Hosn discusses the benefits and limitations of a content personalization strategy based on supervised machine learning in this clip from Streaming Media East 2018.
18 Oct 2018
Microsoft Principal Product Manager Rafah Hosn explains how Microsoft's machine learning-driven decision services helps brands target viewers and increase engagement in this clip from Streaming Media East 2018.
15 Oct 2018
Comcast Technical Solutions Architect Ribal Najjar discusses how operationalizing commonalities between QoE and QoS metrics to deliver a "super-powerful" dataset in this clip from Streaming Media East 2018.
11 Oct 2018
Comcast Technical Solutions Architect Ribal Najjar defines video QoE both in terms of subjective experience and qualitative measurement in this clip from Streaming Media East 2018.
08 Oct 2018
IRIS.TV CEO & Co-Founder breaks down discusses IRIS.TV's approach to helping traditional media companies capture and leverage audience data and machine learning in this clip from Streaming Media East 2018.
04 Oct 2018
Gannett Senior Director Kara Chiles discusses how USA Today leveraged IRIS.TV and data to localize and personalize their Winter Olympics 2018 coverage in this clip from Streaming Media East 2018.
01 Oct 2018
ZoneTV's Tom Sauer describes how machine learning can be used to overhaul the TV world and deliver more individualized experiences in this clip from Streaming Media East 2018.
27 Sep 2018
REELY CEO Cullen Gallagher makes the business-growth case for content owners developing an AI strategy in this clip from Streaming Media East 2018.
24 Sep 2018
IBM Watson Media's David Clevinger discusses how media entities are currently using video AI in this clip from Streaming Media East 2018.
20 Sep 2018
Citrix Principal Architect Josh Gray explains how video enables higher-acuity metrics analysis in this clip from Streaming Media East 2018.
17 Sep 2018
Limelight VP of Architecture Jason Hofmann discusses how AI impacts content delivery optimization in this clip from Streaming Media East 2018.
13 Sep 2018
Google's Leonidas Kantothanassis explores the vast range of applications for machine learning in the media workflow and supply change in this clip from his Content Delivery Summit keynote.
19 Feb 2018
Companies and Suppliers Mentioned