------------ status: published title: MSC slug: munich-security-conference desc: Analyzing the Transnational Flow of Facial Recognition Training Data subdesc: Where does face data originate and who's using it? cssclass: dataset image: assets/background.jpg published: 2019-4-18 updated: 2019-4-19 authors: Adam Harvey ------------ ## Analysis for the Munich Security Conference Transnational Security Report ### sidebar + Images Analyzed: 24,302,637 + Datasets Analyzed: 30 + Years: 2006 - 2018 + Status: Ongoing Investigation + Last Updated: June 27, 2019 ### end sidebar Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum." Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum." === columns 2 ``` single_pie_chart /site/research/munich_security_conference/assets/megapixels_origins_top.csv Caption: Sources of Publicly Available Face Training Data 2006 - 2018 Top: 10 OtherLabel: Other ``` === ``` single_pie_chart /site/research/munich_security_conference/assets/summary_countries.csv Caption: Locations Where Face Data Is Used Top: 14 OtherLabel: Other ``` === end columns === columns 2 #### Sources of Face Data Add text | Source | Images | | --- | --- | |Search Engines | 30,127,200 | |Flickr.com | 11,783,888 | |IMDb.com | 5,251,410 | |CCTV | 959,312 | |Wikimedia.org | 183,500 | |Mugshots | 113,268 | |Other Sources Combined | 37,044 | |YouTube.com | 31,888 | === #### Where Face Data Is Used Add text |country | citations| | --- | --- | |China | 327| |United States | 302| |United Kingdom | 187| |Australia | 38| |Germany | 35| |Singapore | 27| |Canada | 25| |Netherlands | 25| |Italy | 22| |France | 17| |India | 14| |South Korea | 12| |Spain | 10| |Switzerland | 9| === end columns ## Over 6,000 Embassy Images on Flickr Found in Face Recognition Datasets Including over 2,000 more for racial analysis ![caption: MegaFace from U.S. Embassy Canberra](assets/4730007024.jpg) ![caption: An image from the MegaFace dataset obtained from United Kingdom's Embassy in Italy https://flickr.com/photos/ukinitaly](assets/4606260362.jpg) ![caption: An image from the MegaFace dataset obtained from the Flickr account of the United States Embassy in Kabul, Afghanistan https://flickr.com/photos/kabulpublicdiplomacy](assets/4749096858.jpg) === columns 2 ``` single_pie_chart /site/research/munich_security_conference/assets/megapixels_origins_top.csv Caption: Sources of Face Training Data Top: 5 OtherLabel: Other Countries Colors: categoryRainbow ``` =========== ``` single_pie_chart /site/research/munich_security_conference/assets/embassy_counts_summary_dataset.csv Caption: Dataset sources Top: 4 OtherLabel: Other Colors: categoryRainbow ``` === end columns {% include 'supplementary_header.html' %} ``` load_file /site/research/munich_security_conference/assets/embassy_counts_public.csv Headings: Images, Dataset, Embassy, Flickr ID, URL, Guest, Host ``` {% include 'cite_our_work.html' %}