From c07ee775f0ae32fe13d950787c178e84d9bd2bcd Mon Sep 17 00:00:00 2001 From: adamhrv Date: Mon, 11 Mar 2019 00:27:14 +0100 Subject: ars update, preview --- site/public/datasets/lfw/index.html | 1 + 1 file changed, 1 insertion(+) (limited to 'site/public/datasets/lfw/index.html') diff --git a/site/public/datasets/lfw/index.html b/site/public/datasets/lfw/index.html index 814cd167..4fbd06a5 100644 --- a/site/public/datasets/lfw/index.html +++ b/site/public/datasets/lfw/index.html @@ -37,6 +37,7 @@
  • * denotes partial funding for related research
  • Labeled Faces in the Wild

    +

    (PAGE UNDER DEVELOPMENT)

    Labeled Faces in The Wild (LFW) is "a database of face photographs designed for studying the problem of unconstrained face recognition 1. It is used to evaluate and improve the performance of facial recognition algorithms in academic, commercial, and government research. According to BiometricUpdate.com 3, LFW is "the most widely used evaluation set in the field of facial recognition, LFW attracts a few dozen teams from around the globe including Google, Facebook, Microsoft Research Asia, Baidu, Tencent, SenseTime, Face++ and Chinese University of Hong Kong."

    The LFW dataset includes 13,233 images of 5,749 people that were collected between 2002-2004. LFW is a subset of Names of Faces and is part of the first facial recognition training dataset created entirely from images appearing on the Internet. The people appearing in LFW are...

    The Names and Faces dataset was the first face recognition dataset created entire from online photos. However, Names and Faces and LFW are not the first face recognition dataset created entirely "in the wild". That title belongs to the UCD dataset. Images obtained "in the wild" means using an image without explicit consent or awareness from the subject or photographer.

    -- cgit v1.2.3-70-g09d2 From e0d81dfc33e87c7677c866b602a877899e47e6a5 Mon Sep 17 00:00:00 2001 From: adamhrv Date: Mon, 11 Mar 2019 01:07:25 +0100 Subject: update text, ars --- site/content/pages/about/index.md | 2 +- site/content/pages/datasets/brainwash/index.md | 4 ++-- site/content/pages/datasets/lfw/index.md | 4 ++-- site/content/pages/datasets/mars/index.md | 4 ++-- site/public/about/index.html | 2 +- site/public/datasets/brainwash/index.html | 4 ++-- site/public/datasets/lfw/index.html | 4 ++-- site/public/datasets/mars/index.html | 4 ++-- 8 files changed, 14 insertions(+), 14 deletions(-) (limited to 'site/public/datasets/lfw/index.html') diff --git a/site/content/pages/about/index.md b/site/content/pages/about/index.md index deb4c0e7..9c66fbc4 100644 --- a/site/content/pages/about/index.md +++ b/site/content/pages/about/index.md @@ -27,7 +27,7 @@ authors: Adam Harvey

    Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation face recognition technology. Today, these datasets no longer originate in labs, but instead from family photo albums posted on social media sites, CCTV camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube.

    -While many of these datasets include public figures such as politicans, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research studies, but when examined further it becomes clear that they're also used by defense contractors in foreign countries. +While many of these datasets include public figures such as politicians, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research, but when examined further it becomes clear that they're also used by foreign defense agencies. During the last year, hundreds of these facial analysis datasets created "in the wild" have been collected to understand how they contribute to a global supply chain of biometric data that is helping to power the global facial recognition industry. diff --git a/site/content/pages/datasets/brainwash/index.md b/site/content/pages/datasets/brainwash/index.md index 4812e55d..a01f5bf4 100644 --- a/site/content/pages/datasets/brainwash/index.md +++ b/site/content/pages/datasets/brainwash/index.md @@ -2,8 +2,8 @@ status: published title: Brainwash -desc: Brainwash is a dataset of people from webcams the Brainwash Cafe in San Francisco being used to train face detection algorithms -subdesc: Brainwash dataset includes 11,918 images of "everyday life of a busy downtown cafe" +desc: Brainwash is a dataset of webcam images from the Brainwash Cafe in San Francisco +subdesc: The Brainwash dataset includes 11,918 images of "everyday life of a busy downtown cafe" and is used for training head detection algorithms slug: brainwash cssclass: dataset color: #ffaa00 diff --git a/site/content/pages/datasets/lfw/index.md b/site/content/pages/datasets/lfw/index.md index b07c0e4b..b803efc5 100644 --- a/site/content/pages/datasets/lfw/index.md +++ b/site/content/pages/datasets/lfw/index.md @@ -2,8 +2,8 @@ status: published title: Labeled Faces in The Wild -desc: Labeled Faces in The Wild (LFW) is a database of face photographs designed for studying the problem of unconstrained face recognition. -subdesc: It includes 13,456 images of 4,432 people's images copied from the Internet during 2002-2004. +desc: Labeled Faces in The Wild (LFW) is the first facial recognition dataset created entirely from online photos +subdesc: It includes 13,456 images of 4,432 people's images copied from the Internet during 2002-2004 and is the most frequently used dataset in the world for benchmarking face recognition algorithms. image: assets/background.jpg slug: lfw year: 2007 diff --git a/site/content/pages/datasets/mars/index.md b/site/content/pages/datasets/mars/index.md index 93edaeea..30c9a4d7 100644 --- a/site/content/pages/datasets/mars/index.md +++ b/site/content/pages/datasets/mars/index.md @@ -2,8 +2,8 @@ status: published title: MARS -desc: MARS is a dataset of people... -subdesc: MARS includes... +desc: The Motion Analysis and Re-identification Set (MARS) is a MARS dataset is collection of CCTV footage +subdesc: The MARS dataset includes 1,191,003 of people and is used for training person re-identification algorithms slug: mars cssclass: dataset image: assets/background.jpg diff --git a/site/public/about/index.html b/site/public/about/index.html index 15c4a831..8a95825d 100644 --- a/site/public/about/index.html +++ b/site/public/about/index.html @@ -36,7 +36,7 @@
  • Privacy Policy
  • (PAGE UNDER DEVELOPMENT)

    -

    Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation face recognition technology. Today, these datasets no longer originate in labs, but instead from family photo albums posted on social media sites, CCTV camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube.

    While many of these datasets include public figures such as politicans, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research studies, but when examined further it becomes clear that they're also used by defense contractors in foreign countries.

    +

    Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation face recognition technology. Today, these datasets no longer originate in labs, but instead from family photo albums posted on social media sites, CCTV camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube.

    While many of these datasets include public figures such as politicians, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research, but when examined further it becomes clear that they're also used by foreign defense agencies.

    During the last year, hundreds of these facial analysis datasets created "in the wild" have been collected to understand how they contribute to a global supply chain of biometric data that is helping to power the global facial recognition industry.

    MegaPixels is art and research by Adam Harvey about publicly available facial recognition datasets that aims to unravel their histories, futures, geographies, and contents. Throughout 2019 this site, coded by Jules LaPlace, will publish research reports, visualizations, downloadable statistics, and interactive tools for searching the datasets.

    The MegaPixels website is produced in partnership with Mozilla who provided the funding to research the datasets, build the site, and develop tools to help you understand the role these datasets have played in creating biometric surveillance technologies.

    diff --git a/site/public/datasets/brainwash/index.html b/site/public/datasets/brainwash/index.html index 9cf2db0d..ab002c78 100644 --- a/site/public/datasets/brainwash/index.html +++ b/site/public/datasets/brainwash/index.html @@ -4,7 +4,7 @@ MegaPixels - + @@ -26,7 +26,7 @@
    -
    Brainwash is a dataset of people from webcams the Brainwash Cafe in San Francisco being used to train face detection algorithms
    Brainwash dataset includes 11,918 images of "everyday life of a busy downtown cafe" +
    Brainwash is a dataset of webcam images from the Brainwash Cafe in San Francisco
    The Brainwash dataset includes 11,918 images of "everyday life of a busy downtown cafe" and is used for training head detection algorithms

    Brainwash Dataset

    (PAGE UNDER DEVELOPMENT)

    Brainwash is a face detection dataset created from the Brainwash Cafe's livecam footage including 11,918 images of "everyday life of a busy downtown cafe 1". The images are used to develop face detection algorithms for the "challenging task of detecting people in crowded scenes" and tracking them.

    diff --git a/site/public/datasets/lfw/index.html b/site/public/datasets/lfw/index.html index 4fbd06a5..477673e2 100644 --- a/site/public/datasets/lfw/index.html +++ b/site/public/datasets/lfw/index.html @@ -4,7 +4,7 @@ MegaPixels - + @@ -26,7 +26,7 @@
    -
    Labeled Faces in The Wild (LFW) is a database of face photographs designed for studying the problem of unconstrained face recognition.
    It includes 13,456 images of 4,432 people's images copied from the Internet during 2002-2004. +
    Labeled Faces in The Wild (LFW) is the first facial recognition dataset created entirely from online photos
    It includes 13,456 images of 4,432 people's images copied from the Internet during 2002-2004 and is the most frequently used dataset in the world for benchmarking face recognition algorithms.