Alice Xiang Global Head of AI Governance
Sony involved researchers, engineers, legal and privacy specialists, IT staff and quality assurance teams, not to mention substantial time and money.
“ For companies accustomed to rapid iteration, the effort required to build consensually-sourced and globally-representative datasets can feel overwhelming,” she adds.
The second barrier is visual diversity, creating what Alice calls a“ perceived trade-off” between ethical sourcing and dataset breadth.
“ Consensually-collected datasets often have less visual variety than scraped datasets simply because the web contains such a range of images,” she explains.
Ultimately, Alice believes the aforementioned trade-off is a question of infrastructure and investment, as opposed to technical limitations.
As regulation tightens amid shifting public expectations of AI, she is certain the pressure to build responsibly-sourced datasets at scale will only intensify.
aimagazine. com 25