From c07ee775f0ae32fe13d950787c178e84d9bd2bcd Mon Sep 17 00:00:00 2001
From: adamhrv
Date: Mon, 11 Mar 2019 00:27:14 +0100
Subject: ars update, preview
---
site/public/datasets/lfw/index.html | 1 +
1 file changed, 1 insertion(+)
(limited to 'site/public/datasets/lfw/index.html')
diff --git a/site/public/datasets/lfw/index.html b/site/public/datasets/lfw/index.html
index 814cd167..4fbd06a5 100644
--- a/site/public/datasets/lfw/index.html
+++ b/site/public/datasets/lfw/index.html
@@ -37,6 +37,7 @@
* denotes partial funding for related research
Labeled Faces in the Wild
+
(PAGE UNDER DEVELOPMENT)
Labeled Faces in The Wild (LFW) is "a database of face photographs designed for studying the problem of unconstrained face recognition1. It is used to evaluate and improve the performance of facial recognition algorithms in academic, commercial, and government research. According to BiometricUpdate.com3, LFW is "the most widely used evaluation set in the field of facial recognition, LFW attracts a few dozen teams from around the globe including Google, Facebook, Microsoft Research Asia, Baidu, Tencent, SenseTime, Face++ and Chinese University of Hong Kong."
The LFW dataset includes 13,233 images of 5,749 people that were collected between 2002-2004. LFW is a subset of Names of Faces and is part of the first facial recognition training dataset created entirely from images appearing on the Internet. The people appearing in LFW are...
The Names and Faces dataset was the first face recognition dataset created entire from online photos. However, Names and Faces and LFW are not the first face recognition dataset created entirely "in the wild". That title belongs to the UCD dataset. Images obtained "in the wild" means using an image without explicit consent or awareness from the subject or photographer.
Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation face recognition technology. Today, these datasets no longer originate in labs, but instead from family photo albums posted on social media sites, CCTV camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube.
-While many of these datasets include public figures such as politicans, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research studies, but when examined further it becomes clear that they're also used by defense contractors in foreign countries.
+While many of these datasets include public figures such as politicians, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research, but when examined further it becomes clear that they're also used by foreign defense agencies.
During the last year, hundreds of these facial analysis datasets created "in the wild" have been collected to understand how they contribute to a global supply chain of biometric data that is helping to power the global facial recognition industry.
diff --git a/site/content/pages/datasets/brainwash/index.md b/site/content/pages/datasets/brainwash/index.md
index 4812e55d..a01f5bf4 100644
--- a/site/content/pages/datasets/brainwash/index.md
+++ b/site/content/pages/datasets/brainwash/index.md
@@ -2,8 +2,8 @@
status: published
title: Brainwash
-desc: Brainwash is a dataset of people from webcams the Brainwash Cafe in San Francisco being used to train face detection algorithms
-subdesc: Brainwash dataset includes 11,918 images of "everyday life of a busy downtown cafe"
+desc: Brainwash is a dataset of webcam images from the Brainwash Cafe in San Francisco
+subdesc: The Brainwash dataset includes 11,918 images of "everyday life of a busy downtown cafe" and is used for training head detection algorithms
slug: brainwash
cssclass: dataset
color: #ffaa00
diff --git a/site/content/pages/datasets/lfw/index.md b/site/content/pages/datasets/lfw/index.md
index b07c0e4b..b803efc5 100644
--- a/site/content/pages/datasets/lfw/index.md
+++ b/site/content/pages/datasets/lfw/index.md
@@ -2,8 +2,8 @@
status: published
title: Labeled Faces in The Wild
-desc: Labeled Faces in The Wild (LFW) is a database of face photographs designed for studying the problem of unconstrained face recognition.
-subdesc: It includes 13,456 images of 4,432 people's images copied from the Internet during 2002-2004.
+desc: Labeled Faces in The Wild (LFW) is the first facial recognition dataset created entirely from online photos
+subdesc: It includes 13,456 images of 4,432 people's images copied from the Internet during 2002-2004 and is the most frequently used dataset in the world for benchmarking face recognition algorithms.
image: assets/background.jpg
slug: lfw
year: 2007
diff --git a/site/content/pages/datasets/mars/index.md b/site/content/pages/datasets/mars/index.md
index 93edaeea..30c9a4d7 100644
--- a/site/content/pages/datasets/mars/index.md
+++ b/site/content/pages/datasets/mars/index.md
@@ -2,8 +2,8 @@
status: published
title: MARS
-desc: MARS is a dataset of people...
-subdesc: MARS includes...
+desc: The Motion Analysis and Re-identification Set (MARS) is a MARS dataset is collection of CCTV footage
+subdesc: The MARS dataset includes 1,191,003 of people and is used for training person re-identification algorithms
slug: mars
cssclass: dataset
image: assets/background.jpg
diff --git a/site/public/about/index.html b/site/public/about/index.html
index 15c4a831..8a95825d 100644
--- a/site/public/about/index.html
+++ b/site/public/about/index.html
@@ -36,7 +36,7 @@
Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation face recognition technology. Today, these datasets no longer originate in labs, but instead from family photo albums posted on social media sites, CCTV camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube.
While many of these datasets include public figures such as politicans, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research studies, but when examined further it becomes clear that they're also used by defense contractors in foreign countries.
+
Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation face recognition technology. Today, these datasets no longer originate in labs, but instead from family photo albums posted on social media sites, CCTV camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube.
While many of these datasets include public figures such as politicians, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research, but when examined further it becomes clear that they're also used by foreign defense agencies.
During the last year, hundreds of these facial analysis datasets created "in the wild" have been collected to understand how they contribute to a global supply chain of biometric data that is helping to power the global facial recognition industry.
MegaPixels is art and research by Adam Harvey about publicly available facial recognition datasets that aims to unravel their histories, futures, geographies, and contents. Throughout 2019 this site, coded by Jules LaPlace, will publish research reports, visualizations, downloadable statistics, and interactive tools for searching the datasets.
The MegaPixels website is produced in partnership with Mozilla who provided the funding to research the datasets, build the site, and develop tools to help you understand the role these datasets have played in creating biometric surveillance technologies.
Brainwash is a face detection dataset created from the Brainwash Cafe's livecam footage including 11,918 images of "everyday life of a busy downtown cafe1". The images are used to develop face detection algorithms for the "challenging task of detecting people in crowded scenes" and tracking them.
Labeled Faces in The Wild (LFW) is a database of face photographs designed for studying the problem of unconstrained face recognition.
It includes 13,456 images of 4,432 people's images copied from the Internet during 2002-2004.
+
Labeled Faces in The Wild (LFW) is the first facial recognition dataset created entirely from online photos
It includes 13,456 images of 4,432 people's images copied from the Internet during 2002-2004 and is the most frequently used dataset in the world for benchmarking face recognition algorithms.
The Motion Analysis and Re-identification Set (MARS) is a MARS dataset is collection of CCTV footage
The MARS dataset includes 1,191,003 of people and is used for training person re-identification algorithms
Collected
TBD
Published
TBD
Images
TBD
Faces
TBD
MARS
(PAGE UNDER DEVELOPMENT)
At vero eos et accusamus et iusto odio dignissimos ducimus, qui blanditiis praesentium voluptatum deleniti atque corrupti, quos dolores et quas molestias excepturi sint, obcaecati cupiditate non-provident, similique sunt in culpa, qui officia deserunt mollitia animi, id est laborum et dolorum fuga. Et harum quidem rerum facilis est et expedita distinctio.