AUTHOR=Baird Alice , Schuller Björn TITLE=Considerations for a More Ethical Approach to Data in AI: On Data Representation and Infrastructure JOURNAL=Frontiers in Big Data VOLUME=Volume 3 - 2020 YEAR=2020 URL=https://www.frontiersin.org/journals/big-data/articles/10.3389/fdata.2020.00025 DOI=10.3389/fdata.2020.00025 ISSN=2624-909X ABSTRACT=Data forms the development of Artificial Intelligence (AI) as we currently know it, and for many years centralised networking infrastructures have dominated both the sourcing and subsequent use of such data. Research suggests that centralised approaches result in poor representation, and as AI is now integrated more in daily life, there is a need for efforts to improve on this. The AI research community has now begun to explore managing data infrastructures more democratically, finding that decentralised networking allows for more transparency and therefore can alleviate core ethical concerns such as selection-bias. With this in mind, herein, we present a survey-based overview framed around data representation and data infrastructures in AI. We outline four key considerations (auditing, benchmarking, confidence and trust, explainability and interpretability) as they pertain to data-driven AI, and propose that incorporation of these aspects, along with improved interdisciplinary discussion may aid the mitigation of a plethora of ethical concerns relating to data and AI.