MegaFace is a dataset of 4,700,000 face images of 672,000 individuals used for developing face recognition technologies. All images were downloaded from Flickr.
MegaFace was developed by the University of Washington for the purpose of trainng, validating, and benchmarking face recognition algorithms.
The images are from Flickr, but are they all from YFCC100M?
MegaFace was used for research projects associated with SenseTime, Google, Mitsubishi, Vision Semantics Ltd, Microsoft.
MegaFace was also used for MegaFace Asian, and MegaAge, and glasses.
Used for face recognition
screenshots of papers
This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries.
To help understand how MegaFace Dataset has been used around the world by commercial, military, and academic organizations; existing publicly available research citing MegaFace Dataset was collected, verified, and geocoded to show how AI training data has proliferated around the world. Click on the markers to reveal research projects at that location.
The dataset citations used in the visualizations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Each citation was geocoded using names of institutions found in the PDF front matter, or as listed on other resources. These papers have been manually verified to show that researchers downloaded and used the dataset to trainĀ or test machine learning algorithms. If you use our data, please cite our work.
Age and gender estimation distribution were calculated by anlayzing all faces in the dataset images. This may include additional faces appearing next to an annotated face, or this may skip false faces that were erroneously included as part of the original dataset. These numbers are provided as an estimation and not a factual representation of the exact gender and age of all faces.
If you find this analysis helpful, please cite our work:
@online{megapixels,
author = {Harvey, Adam. LaPlace, Jules.},
title = {MegaPixels: Origins, Ethics, and Privacy Implications of Publicly Available Face Recognition Image Datasets},
year = 2019,
url = {https://megapixels.cc/},
urldate = {2019-04-18}
}