From c07ee775f0ae32fe13d950787c178e84d9bd2bcd Mon Sep 17 00:00:00 2001 From: adamhrv Date: Mon, 11 Mar 2019 00:27:14 +0100 Subject: ars update, preview --- site/content/pages/about/assets/adam-harvey-3d.jpg | Bin 0 -> 9757 bytes site/content/pages/about/assets/adam-harvey-3d.png | Bin 0 -> 100240 bytes site/content/pages/about/assets/adam-harvey.jpg | Bin 18525 -> 5283 bytes .../pages/about/assets/jules-laplace-3d.jpg | Bin 0 -> 10726 bytes site/content/pages/about/assets/jules-laplace.jpg | Bin 15254 -> 6012 bytes site/content/pages/about/disclaimer.md | 4 +--- site/content/pages/about/index.md | 25 +++++++++++++++------ site/content/pages/about/press.md | 2 +- site/content/pages/about/privacy.md | 2 ++ site/content/pages/about/terms.md | 2 ++ .../pages/datasets/50_people_one_question/index.md | 8 ++++--- site/content/pages/datasets/brainwash/index.md | 2 ++ site/content/pages/datasets/celeba/index.md | 4 +++- site/content/pages/datasets/cofw/index.md | 9 ++++---- site/content/pages/datasets/lfw/index.md | 2 ++ site/content/pages/datasets/mars/index.md | 4 +++- 16 files changed, 44 insertions(+), 20 deletions(-) create mode 100644 site/content/pages/about/assets/adam-harvey-3d.jpg create mode 100644 site/content/pages/about/assets/adam-harvey-3d.png create mode 100644 site/content/pages/about/assets/jules-laplace-3d.jpg (limited to 'site/content/pages') diff --git a/site/content/pages/about/assets/adam-harvey-3d.jpg b/site/content/pages/about/assets/adam-harvey-3d.jpg new file mode 100644 index 00000000..2d7cbd80 Binary files /dev/null and b/site/content/pages/about/assets/adam-harvey-3d.jpg differ diff --git a/site/content/pages/about/assets/adam-harvey-3d.png b/site/content/pages/about/assets/adam-harvey-3d.png new file mode 100644 index 00000000..3616e851 Binary files /dev/null and b/site/content/pages/about/assets/adam-harvey-3d.png differ diff --git a/site/content/pages/about/assets/adam-harvey.jpg b/site/content/pages/about/assets/adam-harvey.jpg index e0ab893a..38a484d1 100644 Binary files a/site/content/pages/about/assets/adam-harvey.jpg and b/site/content/pages/about/assets/adam-harvey.jpg differ diff --git a/site/content/pages/about/assets/jules-laplace-3d.jpg b/site/content/pages/about/assets/jules-laplace-3d.jpg new file mode 100644 index 00000000..d51e0933 Binary files /dev/null and b/site/content/pages/about/assets/jules-laplace-3d.jpg differ diff --git a/site/content/pages/about/assets/jules-laplace.jpg b/site/content/pages/about/assets/jules-laplace.jpg index 310b2783..18fc1170 100644 Binary files a/site/content/pages/about/assets/jules-laplace.jpg and b/site/content/pages/about/assets/jules-laplace.jpg differ diff --git a/site/content/pages/about/disclaimer.md b/site/content/pages/about/disclaimer.md index 97edc461..314675ee 100644 --- a/site/content/pages/about/disclaimer.md +++ b/site/content/pages/about/disclaimer.md @@ -23,9 +23,7 @@ authors: Adam Harvey -### Sidebar - -## End Sidebar +(TEMPORARY PAGE) Last updated: December 04, 2018 diff --git a/site/content/pages/about/index.md b/site/content/pages/about/index.md index dbf4e1bb..deb4c0e7 100644 --- a/site/content/pages/about/index.md +++ b/site/content/pages/about/index.md @@ -23,17 +23,28 @@ authors: Adam Harvey -

Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation of their algorithms. Today, these datasets no longer originate in labs, but instead from family photos albums posted on Flickr, CCTV cameras on college campuses, livestreams at cafes, search engine queries for celebrities, or videos on YouTube.

+(PAGE UNDER DEVELOPMENT) -While these datasets include many public figures, politicans, athletes, and actors, they also include many non-public figures including digital activists, students, and pedestrians. Some images are used with creative commons licenses, but others were taken in unconstrained scenarios without anyone's awareness or consent. During the last year hundreds of these datasets have been collected to understand how they contribute to a global supply chain of biometric data. +

Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation face recognition technology. Today, these datasets no longer originate in labs, but instead from family photo albums posted on social media sites, CCTV camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube.

-MegaPixels is art and research by Adam Harvey about publicly available facial recognition datasets that aims to unravel the stories behind these datasets. During 2019 this site, coded by Jules LaPlace, will publish research reports, visualizations, downloadable statisticds, and interactive tools for searching the datasets. +While many of these datasets include public figures such as politicans, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research studies, but when examined further it becomes clear that they're also used by defense contractors in foreign countries. -This project is produced in partnership with [Mozilla](https://mozilla.org) who has provided the funding to research the datasets, build the site, and develop tools to help you understand the role these datasets have played in creating biometric surveillance technologies. +During the last year, hundreds of these facial analysis datasets created "in the wild" have been collected to understand how they contribute to a global supply chain of biometric data that is helping to power the global facial recognition industry. +MegaPixels is art and research by Adam Harvey about publicly available facial recognition datasets that aims to unravel their histories, futures, geographies, and contents. Throughout 2019 this site, coded by Jules LaPlace, will publish research reports, visualizations, downloadable statistics, and interactive tools for searching the datasets. -## Team +The MegaPixels website is produced in partnership with [Mozilla](https://mozilla.org) who provided the funding to research the datasets, build the site, and develop tools to help you understand the role these datasets have played in creating biometric surveillance technologies. -![sideimage:Adam Harvey](assets/adam-harvey.jpg) **Adam Harvey** is Berlin-based American artist and researcher. His previous projects (CV Dazzle, Stealth Wear, and SkyLift) explore the potential for countersurveillance as artwork. He is the founder of VFRAME (visual forensics software for human rights groups), the recipient of 2 PrototypeFund awards, and a researcher in residence at Karlsruhe HfG. +The MegaPixels site is based on an [earlier installation](https://ahprojects.com/megapixels-glassroom), also supported by Mozilla, at the [Tactical Tech Glassroom](https://theglassroom.org/) in London about the facial recognition datasets; and a commission from the Elevate arts festival curated by Berit Gilma about pedestrian recognition datasets. -![sideimage:Jules LaPlace](assets/jule s-laplace.jpg)**Jules LaPlace** is an American creative technologist also based in Berlin. He was previously the CTO of a digital agency in NYC and now also works at VFRAME, developing computer vision for human rights groups. Jules also builds creative software for artists and musicians. + +### MegaPixels Team + +![sideimage:Adam Harvey](assets/adam-harvey-3d.jpg) **Adam Harvey** is Berlin-based American artist and researcher. His previous projects ([CV Dazzle](https://ahprojects.com/cvdazzle), [Stealth Wear](https://ahprojects.com/stealth-wear), and [SkyLift](https://ahprojects.com/skylift)) explore the potential for countersurveillance as artwork. He is the founder of VFRAME (visual forensics software for human rights groups), the recipient of 2 PrototypeFund awards, and is a researcher in residence at Karlsruhe HfG.
[ahprojects.com](https://ahprojects.com) + +![sideimage:Jules LaPlace](assets/jules-laplace-3d.jpg)**Jules LaPlace** is an American creative technologist also based in Berlin. He was previously the CTO of a digital agency in NYC and now also works at VFRAME, developing computer vision for human rights groups. Jules also builds creative software for artists and musicians.
[asdf.us](https://asdf.us) + + +### Additional Researchers + +Additional research by Berit Gilma. \ No newline at end of file diff --git a/site/content/pages/about/press.md b/site/content/pages/about/press.md index 2e6d3423..759400e8 100644 --- a/site/content/pages/about/press.md +++ b/site/content/pages/about/press.md @@ -23,6 +23,6 @@ authors: Adam Harvey -(list of press articles and images will go here. maybe setup a macro template to use for thumbnail image + press info?) +(TEMPORARY PAGE) - Aug 22, 2018: "Transgender YouTubers had their videos grabbed to train facial recognition software" by James Vincent \ No newline at end of file diff --git a/site/content/pages/about/privacy.md b/site/content/pages/about/privacy.md index 0d908559..b42ccb21 100644 --- a/site/content/pages/about/privacy.md +++ b/site/content/pages/about/privacy.md @@ -23,6 +23,8 @@ authors: Adam Harvey +(TEMPORARY PAGE) + A summary of our privacy policy is as follows: The MegaPixels site does not use any analytics programs or collect any data besides the necessary IP address of your connection, which are deleted every 30 days and used only for security and to prevent misuse. diff --git a/site/content/pages/about/terms.md b/site/content/pages/about/terms.md index 3217e366..40653292 100644 --- a/site/content/pages/about/terms.md +++ b/site/content/pages/about/terms.md @@ -24,6 +24,8 @@ authors: Adam Harvey +(TEMPORARY PAGE) + (FPO: this is only example text) Last updated: December 04, 2018 diff --git a/site/content/pages/datasets/50_people_one_question/index.md b/site/content/pages/datasets/50_people_one_question/index.md index e7dec0aa..2276e386 100644 --- a/site/content/pages/datasets/50_people_one_question/index.md +++ b/site/content/pages/datasets/50_people_one_question/index.md @@ -1,6 +1,6 @@ ------------ -status: published +status: draft title: 50 People One Question desc: People One Question is a dataset of people from an online video series on YouTube and Vimeo used for building facial recogntion algorithms subdesc: People One Question dataset includes ... @@ -23,9 +23,11 @@ authors: Adam Harvey ## 50 People 1 Question - At vero eos et accusamus et iusto odio dignissimos ducimus, qui blanditiis praesentium voluptatum deleniti atque corrupti, quos dolores et quas molestias excepturi sint, obcaecati cupiditate non-provident, similique sunt in culpa, qui officia deserunt mollitia animi, id est laborum et dolorum fuga. Et harum quidem rerum facilis est et expedita distinctio. +(PAGE UNDER DEVELOPMENT) - Nam libero tempore, cum soluta nobis est eligendi optio, cumque nihil impedit, quo minus id, quod maxime placeat, facere possimus, omnis voluptas assumenda est, omnis dolor repellendus. Temporibus autem quibusdam et aut officiis debitis aut rerum necessitatibus saepe eveniet, ut et voluptates repudiandae sint et molestiae non-recusandae. Itaque earum rerum hic tenetur a sapiente delectus, ut aut reiciendis voluptatibus maiores alias consequatur aut perferendis doloribus asperiores repellat +At vero eos et accusamus et iusto odio dignissimos ducimus, qui blanditiis praesentium voluptatum deleniti atque corrupti, quos dolores et quas molestias excepturi sint, obcaecati cupiditate non-provident, similique sunt in culpa, qui officia deserunt mollitia animi, id est laborum et dolorum fuga. Et harum quidem rerum facilis est et expedita distinctio. + +Nam libero tempore, cum soluta nobis est eligendi optio, cumque nihil impedit, quo minus id, quod maxime placeat, facere possimus, omnis voluptas assumenda est, omnis dolor repellendus. Temporibus autem quibusdam et aut officiis debitis aut rerum necessitatibus saepe eveniet, ut et voluptates repudiandae sint et molestiae non-recusandae. Itaque earum rerum hic tenetur a sapiente delectus, ut aut reiciendis voluptatibus maiores alias consequatur aut perferendis doloribus asperiores repellat {% include 'map.html' %} diff --git a/site/content/pages/datasets/brainwash/index.md b/site/content/pages/datasets/brainwash/index.md index 64f7e57d..4812e55d 100644 --- a/site/content/pages/datasets/brainwash/index.md +++ b/site/content/pages/datasets/brainwash/index.md @@ -33,6 +33,8 @@ authors: Adam Harvey ## Brainwash Dataset +(PAGE UNDER DEVELOPMENT) + *Brainwash* is a face detection dataset created from the Brainwash Cafe's livecam footage including 11,918 images of "everyday life of a busy downtown cafe[^readme]". The images are used to develop face detection algorithms for the "challenging task of detecting people in crowded scenes" and tracking them. Before closing in 2017, Brainwash Cafe was a "cafe and laundromat" located in San Francisco's SoMA district. The cafe published a publicy available livestream from the cafe with a view of the cash register, performance stage, and seating area. diff --git a/site/content/pages/datasets/celeba/index.md b/site/content/pages/datasets/celeba/index.md index 19b0291d..a2669cf6 100644 --- a/site/content/pages/datasets/celeba/index.md +++ b/site/content/pages/datasets/celeba/index.md @@ -1,6 +1,6 @@ ------------ -status: published +status: draft title: CelebA desc: CelebA is a dataset of people... subdesc: CelebA includes... @@ -23,6 +23,8 @@ authors: Adam Harvey ## CelebA +(PAGE UNDER DEVELOPMENT) + At vero eos et accusamus et iusto odio dignissimos ducimus, qui blanditiis praesentium voluptatum deleniti atque corrupti, quos dolores et quas molestias excepturi sint, obcaecati cupiditate non-provident, similique sunt in culpa, qui officia deserunt mollitia animi, id est laborum et dolorum fuga. Et harum quidem rerum facilis est et expedita distinctio. Nam libero tempore, cum soluta nobis est eligendi optio, cumque nihil impedit, quo minus id, quod maxime placeat, facere possimus, omnis voluptas assumenda est, omnis dolor repellendus. Temporibus autem quibusdam et aut officiis debitis aut rerum necessitatibus saepe eveniet, ut et voluptates repudiandae sint et molestiae non-recusandae. Itaque earum rerum hic tenetur a sapiente delectus, ut aut reiciendis voluptatibus maiores alias consequatur aut perferendis doloribus asperiores repellat diff --git a/site/content/pages/datasets/cofw/index.md b/site/content/pages/datasets/cofw/index.md index 3b1cdb2b..d017f405 100644 --- a/site/content/pages/datasets/cofw/index.md +++ b/site/content/pages/datasets/cofw/index.md @@ -1,6 +1,6 @@ ------------ -status: published +status: draft title: Caltech Occluded Faces in The Wild desc: COFW: Caltech Occluded Faces in The Wild slug: cofw @@ -10,7 +10,8 @@ authors: Adam Harvey ------------ -# Caltech Occluded Faces in The Wild + +### sidebar + Years: 1993-1996 + Images: 14,126 @@ -19,9 +20,9 @@ authors: Adam Harvey + Funded by: ODNI, IARPA, Microsoft - +## Caltech Occluded Faces in the Wild -![](assets/cofw_index.gif) +(PAGE UNDER DEVELOPMENT) COFW is "is designed to benchmark face landmark algorithms in realistic conditions, which include heavy occlusions and large shape variations" [Robust face landmark estimation under occlusion]. diff --git a/site/content/pages/datasets/lfw/index.md b/site/content/pages/datasets/lfw/index.md index cb326f0b..b07c0e4b 100644 --- a/site/content/pages/datasets/lfw/index.md +++ b/site/content/pages/datasets/lfw/index.md @@ -33,6 +33,8 @@ authors: Adam Harvey ## Labeled Faces in the Wild +(PAGE UNDER DEVELOPMENT) + *Labeled Faces in The Wild* (LFW) is "a database of face photographs designed for studying the problem of unconstrained face recognition[^lfw_www]. It is used to evaluate and improve the performance of facial recognition algorithms in academic, commercial, and government research. According to BiometricUpdate.com[^lfw_pingan], LFW is "the most widely used evaluation set in the field of facial recognition, LFW attracts a few dozen teams from around the globe including Google, Facebook, Microsoft Research Asia, Baidu, Tencent, SenseTime, Face++ and Chinese University of Hong Kong." The LFW dataset includes 13,233 images of 5,749 people that were collected between 2002-2004. LFW is a subset of *Names of Faces* and is part of the first facial recognition training dataset created entirely from images appearing on the Internet. The people appearing in LFW are... diff --git a/site/content/pages/datasets/mars/index.md b/site/content/pages/datasets/mars/index.md index 19f9ced4..93edaeea 100644 --- a/site/content/pages/datasets/mars/index.md +++ b/site/content/pages/datasets/mars/index.md @@ -21,7 +21,9 @@ authors: Adam Harvey + Faces: TBD -## 50 MARS +## MARS + +(PAGE UNDER DEVELOPMENT) At vero eos et accusamus et iusto odio dignissimos ducimus, qui blanditiis praesentium voluptatum deleniti atque corrupti, quos dolores et quas molestias excepturi sint, obcaecati cupiditate non-provident, similique sunt in culpa, qui officia deserunt mollitia animi, id est laborum et dolorum fuga. Et harum quidem rerum facilis est et expedita distinctio. -- cgit v1.2.3-70-g09d2 From e0d81dfc33e87c7677c866b602a877899e47e6a5 Mon Sep 17 00:00:00 2001 From: adamhrv Date: Mon, 11 Mar 2019 01:07:25 +0100 Subject: update text, ars --- site/content/pages/about/index.md | 2 +- site/content/pages/datasets/brainwash/index.md | 4 ++-- site/content/pages/datasets/lfw/index.md | 4 ++-- site/content/pages/datasets/mars/index.md | 4 ++-- site/public/about/index.html | 2 +- site/public/datasets/brainwash/index.html | 4 ++-- site/public/datasets/lfw/index.html | 4 ++-- site/public/datasets/mars/index.html | 4 ++-- 8 files changed, 14 insertions(+), 14 deletions(-) (limited to 'site/content/pages') diff --git a/site/content/pages/about/index.md b/site/content/pages/about/index.md index deb4c0e7..9c66fbc4 100644 --- a/site/content/pages/about/index.md +++ b/site/content/pages/about/index.md @@ -27,7 +27,7 @@ authors: Adam Harvey

Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation face recognition technology. Today, these datasets no longer originate in labs, but instead from family photo albums posted on social media sites, CCTV camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube.

-While many of these datasets include public figures such as politicans, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research studies, but when examined further it becomes clear that they're also used by defense contractors in foreign countries. +While many of these datasets include public figures such as politicians, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research, but when examined further it becomes clear that they're also used by foreign defense agencies. During the last year, hundreds of these facial analysis datasets created "in the wild" have been collected to understand how they contribute to a global supply chain of biometric data that is helping to power the global facial recognition industry. diff --git a/site/content/pages/datasets/brainwash/index.md b/site/content/pages/datasets/brainwash/index.md index 4812e55d..a01f5bf4 100644 --- a/site/content/pages/datasets/brainwash/index.md +++ b/site/content/pages/datasets/brainwash/index.md @@ -2,8 +2,8 @@ status: published title: Brainwash -desc: Brainwash is a dataset of people from webcams the Brainwash Cafe in San Francisco being used to train face detection algorithms -subdesc: Brainwash dataset includes 11,918 images of "everyday life of a busy downtown cafe" +desc: Brainwash is a dataset of webcam images from the Brainwash Cafe in San Francisco +subdesc: The Brainwash dataset includes 11,918 images of "everyday life of a busy downtown cafe" and is used for training head detection algorithms slug: brainwash cssclass: dataset color: #ffaa00 diff --git a/site/content/pages/datasets/lfw/index.md b/site/content/pages/datasets/lfw/index.md index b07c0e4b..b803efc5 100644 --- a/site/content/pages/datasets/lfw/index.md +++ b/site/content/pages/datasets/lfw/index.md @@ -2,8 +2,8 @@ status: published title: Labeled Faces in The Wild -desc: Labeled Faces in The Wild (LFW) is a database of face photographs designed for studying the problem of unconstrained face recognition. -subdesc: It includes 13,456 images of 4,432 people's images copied from the Internet during 2002-2004. +desc: Labeled Faces in The Wild (LFW) is the first facial recognition dataset created entirely from online photos +subdesc: It includes 13,456 images of 4,432 people's images copied from the Internet during 2002-2004 and is the most frequently used dataset in the world for benchmarking face recognition algorithms. image: assets/background.jpg slug: lfw year: 2007 diff --git a/site/content/pages/datasets/mars/index.md b/site/content/pages/datasets/mars/index.md index 93edaeea..30c9a4d7 100644 --- a/site/content/pages/datasets/mars/index.md +++ b/site/content/pages/datasets/mars/index.md @@ -2,8 +2,8 @@ status: published title: MARS -desc: MARS is a dataset of people... -subdesc: MARS includes... +desc: The Motion Analysis and Re-identification Set (MARS) is a MARS dataset is collection of CCTV footage +subdesc: The MARS dataset includes 1,191,003 of people and is used for training person re-identification algorithms slug: mars cssclass: dataset image: assets/background.jpg diff --git a/site/public/about/index.html b/site/public/about/index.html index 15c4a831..8a95825d 100644 --- a/site/public/about/index.html +++ b/site/public/about/index.html @@ -36,7 +36,7 @@
  • Privacy Policy
  • (PAGE UNDER DEVELOPMENT)

    -

    Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation face recognition technology. Today, these datasets no longer originate in labs, but instead from family photo albums posted on social media sites, CCTV camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube.

    While many of these datasets include public figures such as politicans, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research studies, but when examined further it becomes clear that they're also used by defense contractors in foreign countries.

    +

    Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation face recognition technology. Today, these datasets no longer originate in labs, but instead from family photo albums posted on social media sites, CCTV camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube.

    While many of these datasets include public figures such as politicians, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research, but when examined further it becomes clear that they're also used by foreign defense agencies.

    During the last year, hundreds of these facial analysis datasets created "in the wild" have been collected to understand how they contribute to a global supply chain of biometric data that is helping to power the global facial recognition industry.

    MegaPixels is art and research by Adam Harvey about publicly available facial recognition datasets that aims to unravel their histories, futures, geographies, and contents. Throughout 2019 this site, coded by Jules LaPlace, will publish research reports, visualizations, downloadable statistics, and interactive tools for searching the datasets.

    The MegaPixels website is produced in partnership with Mozilla who provided the funding to research the datasets, build the site, and develop tools to help you understand the role these datasets have played in creating biometric surveillance technologies.

    diff --git a/site/public/datasets/brainwash/index.html b/site/public/datasets/brainwash/index.html index 9cf2db0d..ab002c78 100644 --- a/site/public/datasets/brainwash/index.html +++ b/site/public/datasets/brainwash/index.html @@ -4,7 +4,7 @@ MegaPixels - + @@ -26,7 +26,7 @@
    -
    Brainwash is a dataset of people from webcams the Brainwash Cafe in San Francisco being used to train face detection algorithms
    Brainwash dataset includes 11,918 images of "everyday life of a busy downtown cafe" +
    Brainwash is a dataset of webcam images from the Brainwash Cafe in San Francisco
    The Brainwash dataset includes 11,918 images of "everyday life of a busy downtown cafe" and is used for training head detection algorithms

    Brainwash Dataset

    (PAGE UNDER DEVELOPMENT)

    Brainwash is a face detection dataset created from the Brainwash Cafe's livecam footage including 11,918 images of "everyday life of a busy downtown cafe 1". The images are used to develop face detection algorithms for the "challenging task of detecting people in crowded scenes" and tracking them.

    diff --git a/site/public/datasets/lfw/index.html b/site/public/datasets/lfw/index.html index 4fbd06a5..477673e2 100644 --- a/site/public/datasets/lfw/index.html +++ b/site/public/datasets/lfw/index.html @@ -4,7 +4,7 @@ MegaPixels - + @@ -26,7 +26,7 @@
    -
    Labeled Faces in The Wild (LFW) is a database of face photographs designed for studying the problem of unconstrained face recognition.
    It includes 13,456 images of 4,432 people's images copied from the Internet during 2002-2004. +
    Labeled Faces in The Wild (LFW) is the first facial recognition dataset created entirely from online photos
    It includes 13,456 images of 4,432 people's images copied from the Internet during 2002-2004 and is the most frequently used dataset in the world for benchmarking face recognition algorithms.

    (PAGE UNDER DEVELOPMENT)

    -

    Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation face recognition technology. Today, these datasets no longer originate in labs, but instead from family photo albums posted on social media sites, CCTV camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube.

    While many of these datasets include public figures such as politicians, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research, but when examined further it becomes clear that they're also used by foreign defense agencies.

    -

    During the last year, hundreds of these facial analysis datasets created "in the wild" have been collected to understand how they contribute to a global supply chain of biometric data that is helping to power the global facial recognition industry.

    -

    MegaPixels is art and research by Adam Harvey about publicly available facial recognition datasets that aims to unravel their histories, futures, geographies, and contents. Throughout 2019 this site, coded by Jules LaPlace, will publish research reports, visualizations, downloadable statistics, and interactive tools for searching the datasets.

    -

    The MegaPixels website is produced in partnership with Mozilla who provided the funding to research the datasets, build the site, and develop tools to help you understand the role these datasets have played in creating biometric surveillance technologies.

    -

    The MegaPixels site is based on an earlier installation, also supported by Mozilla, at the Tactical Tech Glassroom in London about the facial recognition datasets; and a commission from the Elevate arts festival curated by Berit Gilma about pedestrian recognition datasets.

    +

    Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation face recognition technology. Today, these datasets no longer originate in labs, but instead from family photo albums posted on photo sharing sites, surveillance camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube.

    While many of these datasets include public figures such as politicians, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research, but when examined further it becomes clear that they're also used by foreign defense agencies.

    +

    During the last year, hundreds of these facial analysis datasets created "in the wild" have been collected to understand how they contribute to a global supply chain of biometric data that is powering the global facial recognition industry.

    +

    MegaPixels is art and research by Adam Harvey about publicly available facial recognition datasets that aims to unravel their histories, futures, geographies, and context. Throughout 2019 this site, coded and designed by Jules LaPlace, will publish research reports, visualizations, downloadable statistics, and interactive tools for searching the datasets.

    +

    The MegaPixels website is produced in partnership with Mozilla who provided funding to research the datasets, build the site, and develop tools to help you understand the role these datasets have played in creating biometric surveillance technologies.

    +

    The MegaPixels site is based on an earlier installation (also supported by Mozilla) at the Tactical Tech Glassroom in London in 2017; and a commission from the Elevate arts festival curated by Berit Gilma about pedestrian recognition datasets in 2018, and research during CV Dazzle from 2010-2015. Through the many prototypes, conversations, pitches, PDFs, and false starts this project has endured during the last 5 years, it eventually evolved into something much different than originally imagined. Now, as datasets become increasingly influential in shaping the computational future, it's clear that they must be critically analyzed to understand the biases, shortcomings, funding sources, and contributions to the surveillance industry. However, it's misguided to only criticize these datasets for their flaws without also praising their contribution to society. Without publicly available facial analysis datasets there would be less public discourse, less open-source software, and less peer-reviewed research. Public datasets can indeed become a vital public good for the information economy but as this projects aims to illustrate, many ethical questions arise about consent, intellecture property, surveillance, and privacy.

    MegaPixels Team

    Adam Harvey

    Adam Harvey is Berlin-based American artist and researcher. His previous projects (CV Dazzle, Stealth Wear, and SkyLift) explore the potential for countersurveillance as artwork. He is the founder of VFRAME (visual forensics software for human rights groups), the recipient of 2 PrototypeFund awards, and is a researcher in residence at Karlsruhe HfG.
    ahprojects.com

    Jules LaPlace

    Jules LaPlace is an American creative technologist also based in Berlin. He was previously the CTO of a digital agency in NYC and now also works at VFRAME, developing computer vision for human rights groups. Jules also builds creative software for artists and musicians.
    asdf.us

    diff --git a/site/public/datasets/mars/index.html b/site/public/datasets/mars/index.html index 37e4446a..bfad52a3 100644 --- a/site/public/datasets/mars/index.html +++ b/site/public/datasets/mars/index.html @@ -26,8 +26,8 @@
    -
    The Motion Analysis and Re-identification Set (MARS) is a dataset is collection of CCTV footage
    The MARS dataset includes 1,191,003 of people and is used for training person re-identification algorithms -

    MARS

    +
    The Motion Analysis and Re-identification Set (MARS) is a dataset is collection of CCTV footage
    The MARS dataset includes 1,191,003 of people used for training person re-identification algorithms +

    Motion Analysis and Re-identification Set (MARS)

    (PAGE UNDER DEVELOPMENT)

    At vero eos et accusamus et iusto odio dignissimos ducimus, qui blanditiis praesentium voluptatum deleniti atque corrupti, quos dolores et quas molestias excepturi sint, obcaecati cupiditate non-provident, similique sunt in culpa, qui officia deserunt mollitia animi, id est laborum et dolorum fuga. Et harum quidem rerum facilis est et expedita distinctio.

    Nam libero tempore, cum soluta nobis est eligendi optio, cumque nihil impedit, quo minus id, quod maxime placeat, facere possimus, omnis voluptas assumenda est, omnis dolor repellendus. Temporibus autem quibusdam et aut officiis debitis aut rerum necessitatibus saepe eveniet, ut et voluptates repudiandae sint et molestiae non-recusandae. Itaque earum rerum hic tenetur a sapiente delectus, ut aut reiciendis voluptatibus maiores alias consequatur aut perferendis doloribus asperiores repellat

    -- cgit v1.2.3-70-g09d2 From 092d47ca0a005e5700bee86ec72e2206acdc5fc1 Mon Sep 17 00:00:00 2001 From: adamhrv Date: Mon, 11 Mar 2019 12:08:49 +0100 Subject: text edits --- site/content/pages/about/index.md | 8 +++++--- site/public/about/index.html | 3 ++- 2 files changed, 7 insertions(+), 4 deletions(-) (limited to 'site/content/pages') diff --git a/site/content/pages/about/index.md b/site/content/pages/about/index.md index daeb6dd5..5738d5ce 100644 --- a/site/content/pages/about/index.md +++ b/site/content/pages/about/index.md @@ -35,8 +35,6 @@ MegaPixels is art and research by Adam HarveyDuring the last year, hundreds of these facial analysis datasets created "in the wild" have been collected to understand how they contribute to a global supply chain of biometric data that is powering the global facial recognition industry.

    MegaPixels is art and research by Adam Harvey about publicly available facial recognition datasets that aims to unravel their histories, futures, geographies, and context. Throughout 2019 this site, coded and designed by Jules LaPlace, will publish research reports, visualizations, downloadable statistics, and interactive tools for searching the datasets.

    The MegaPixels website is produced in partnership with Mozilla who provided funding to research the datasets, build the site, and develop tools to help you understand the role these datasets have played in creating biometric surveillance technologies.

    -

    The MegaPixels site is based on an earlier installation (also supported by Mozilla) at the Tactical Tech Glassroom in London in 2017; and a commission from the Elevate arts festival curated by Berit Gilma about pedestrian recognition datasets in 2018, and research during CV Dazzle from 2010-2015. Through the many prototypes, conversations, pitches, PDFs, and false starts this project has endured during the last 5 years, it eventually evolved into something much different than originally imagined. Now, as datasets become increasingly influential in shaping the computational future, it's clear that they must be critically analyzed to understand the biases, shortcomings, funding sources, and contributions to the surveillance industry. However, it's misguided to only criticize these datasets for their flaws without also praising their contribution to society. Without publicly available facial analysis datasets there would be less public discourse, less open-source software, and less peer-reviewed research. Public datasets can indeed become a vital public good for the information economy but as this projects aims to illustrate, many ethical questions arise about consent, intellecture property, surveillance, and privacy.

    MegaPixels Team

    Adam Harvey

    Adam Harvey is Berlin-based American artist and researcher. His previous projects (CV Dazzle, Stealth Wear, and SkyLift) explore the potential for countersurveillance as artwork. He is the founder of VFRAME (visual forensics software for human rights groups), the recipient of 2 PrototypeFund awards, and is a researcher in residence at Karlsruhe HfG.
    ahprojects.com

    Jules LaPlace

    Jules LaPlace is an American creative technologist also based in Berlin. He was previously the CTO of a digital agency in NYC and now also works at VFRAME, developing computer vision for human rights groups. Jules also builds creative software for artists and musicians.
    asdf.us

    Additional Researchers

    Additional research by Berit Gilma.

    +

    Project History

    +

    The MegaPixels site is based on an earlier installation (also supported by Mozilla) at the Tactical Tech Glassroom in London in 2017; and a commission from the Elevate arts festival curated by Berit Gilma about pedestrian recognition datasets in 2018, and research during CV Dazzle from 2010-2015. Through the many prototypes, conversations, pitches, PDFs, and false starts this project has endured during the last 5 years, it eventually evolved into something much different than originally imagined. Now, as datasets become increasingly influential in shaping the computational future, it's clear that they must be critically analyzed to understand the biases, shortcomings, funding sources, and contributions to the surveillance industry. However, it's misguided to only criticize these datasets for their flaws without also praising their contribution to society. Without publicly available facial analysis datasets there would be less public discourse, less open-source software, and less peer-reviewed research. Public datasets can indeed become a vital public good for the information economy but as this projects aims to illustrate, many ethical questions arise about consent, intellectual property, surveillance, and privacy.

    -- cgit v1.2.3-70-g09d2 From 1fe0e5c79c3cbc4b13083116980e62b449866100 Mon Sep 17 00:00:00 2001 From: adamhrv Date: Tue, 12 Mar 2019 01:18:12 +0100 Subject: add credits, 2col about --- site/assets/css/css.css | 40 ++++++++++++ site/content/pages/about/credits.md | 43 ++++++++++++ site/content/pages/about/disclaimer.md | 1 + site/content/pages/about/index.md | 48 +++++++------- site/content/pages/about/press.md | 1 + site/content/pages/about/privacy.md | 1 + site/content/pages/about/terms.md | 1 + .../pages/research/00_introduction/index.md | 16 +++++ site/public/about/credits/index.html | 76 ++++++++++++++++++++++ site/public/about/disclaimer/index.html | 1 + site/public/about/index.html | 32 +++++---- site/public/about/press/index.html | 1 + site/public/about/privacy/index.html | 1 + site/public/about/terms/index.html | 1 + site/public/research/00_introduction/index.html | 12 +++- 15 files changed, 237 insertions(+), 38 deletions(-) create mode 100644 site/content/pages/about/credits.md create mode 100644 site/public/about/credits/index.html (limited to 'site/content/pages') diff --git a/site/assets/css/css.css b/site/assets/css/css.css index a0f9519b..69302409 100644 --- a/site/assets/css/css.css +++ b/site/assets/css/css.css @@ -481,6 +481,46 @@ section.fullwidth .image { color:#fff; border: 0; } + + +/* about page */ + +.flex-container { + padding: 0; + margin: 0; + list-style: none; + + display: -webkit-box; + display: -moz-box; + display: -ms-flexbox; + display: -webkit-flex; + display: flex; + + -webkit-flex-flow: row wrap; + justify-content: space-around; +} + +.team-member { + height: auto; + margin-top: 10px; + color: white; + width: 400px; + font-weight: bold; + flex-grow: 1; + margin:0 40px 0 0; +} +.team-member&:last-child{ + margin:0 0 0 40px; +} +.team-member p{ + font-size:14px; +} +.team-member img{ + margin:0; + display: block; +} + + .sideimage { margin: 0px 0 40px 0; display: flex; diff --git a/site/content/pages/about/credits.md b/site/content/pages/about/credits.md new file mode 100644 index 00000000..3ad962df --- /dev/null +++ b/site/content/pages/about/credits.md @@ -0,0 +1,43 @@ +------------ + +status: published +title: MegaPixels Press and News +desc: MegaPixels Press and News +slug: press +cssclass: about +published: 2018-12-04 +updated: 2018-12-04 +authors: Adam Harvey + +------------ + +# Credits + +
    + +
    + + +#### Team + +- Research, concept: Adam Harvey +- Site development, visualizations: Jules LaPlace +- Assistant researcher: Berit Gilma +- Produced in Partnership with Mozilla + +#### Code + +- This site uses D3 and C2 for visuzations. +- Add more here + +#### Data + +- link to github +- how it was gathered \ No newline at end of file diff --git a/site/content/pages/about/disclaimer.md b/site/content/pages/about/disclaimer.md index 314675ee..f82a09a0 100644 --- a/site/content/pages/about/disclaimer.md +++ b/site/content/pages/about/disclaimer.md @@ -17,6 +17,7 @@ authors: Adam Harvey
    • About
    • Press
    • +
    • Credits
    • Disclaimer
    • Terms and Conditions
    • Privacy Policy
    • diff --git a/site/content/pages/about/index.md b/site/content/pages/about/index.md index 5738d5ce..b1b7a80f 100644 --- a/site/content/pages/about/index.md +++ b/site/content/pages/about/index.md @@ -17,6 +17,7 @@ authors: Adam Harvey
      • About
      • Press
      • +
      • Credits
      • Disclaimer
      • Terms and Conditions
      • Privacy Policy
      • @@ -25,28 +26,25 @@ authors: Adam Harvey (PAGE UNDER DEVELOPMENT) -

        Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation face recognition technology. Today, these datasets no longer originate in labs, but instead from family photo albums posted on photo sharing sites, surveillance camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube.

        - -While many of these datasets include public figures such as politicians, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research, but when examined further it becomes clear that they're also used by foreign defense agencies. - -During the last year, hundreds of these facial analysis datasets created "in the wild" have been collected to understand how they contribute to a global supply chain of biometric data that is powering the global facial recognition industry. - -MegaPixels is art and research by Adam Harvey about publicly available facial recognition datasets that aims to unravel their histories, futures, geographies, and context. Throughout 2019 this site, coded and designed by Jules LaPlace, will publish research reports, visualizations, downloadable statistics, and interactive tools for searching the datasets. - -The MegaPixels website is produced in partnership with [Mozilla](https://mozilla.org) who provided funding to research the datasets, build the site, and develop tools to help you understand the role these datasets have played in creating biometric surveillance technologies. - - -### MegaPixels Team - -![sideimage:Adam Harvey](assets/adam-harvey-3d.jpg) **Adam Harvey** is Berlin-based American artist and researcher. His previous projects ([CV Dazzle](https://ahprojects.com/cvdazzle), [Stealth Wear](https://ahprojects.com/stealth-wear), and [SkyLift](https://ahprojects.com/skylift)) explore the potential for countersurveillance as artwork. He is the founder of VFRAME (visual forensics software for human rights groups), the recipient of 2 PrototypeFund awards, and is a researcher in residence at Karlsruhe HfG.
        [ahprojects.com](https://ahprojects.com) - -![sideimage:Jules LaPlace](assets/jules-laplace-3d.jpg)**Jules LaPlace** is an American creative technologist also based in Berlin. He was previously the CTO of a digital agency in NYC and now also works at VFRAME, developing computer vision for human rights groups. Jules also builds creative software for artists and musicians.
        [asdf.us](https://asdf.us) - - -### Additional Researchers - -Additional research by Berit Gilma. - -### Project History - -The MegaPixels site is based on an earlier [installation](https://ahprojects.com/megapixels-glassroom) (also supported by Mozilla) at the [Tactical Tech Glassroom](https://theglassroom.org/) in London in 2017; and a commission from the Elevate arts festival curated by Berit Gilma about pedestrian recognition datasets in 2018, and research during [CV Dazzle](https://cvdazzle.com) from 2010-2015. Through the many prototypes, conversations, pitches, PDFs, and false starts this project has endured during the last 5 years, it eventually evolved into something much different than originally imagined. Now, as datasets become increasingly influential in shaping the computational future, it's clear that they must be critically analyzed to understand the biases, shortcomings, funding sources, and contributions to the surveillance industry. However, it's misguided to only criticize these datasets for their flaws without also praising their contribution to society. Without publicly available facial analysis datasets there would be less public discourse, less open-source software, and less peer-reviewed research. Public datasets can indeed become a vital public good for the information economy but as this projects aims to illustrate, many ethical questions arise about consent, intellectual property, surveillance, and privacy. \ No newline at end of file +

        MegaPixels is art and research by Adam Harvey about facial recognition datasets that aims to unravel their histories, futures, geographies, and meanings. Throughout 2019 this site, coded by Jules LaPlace, will publish research reports, visualizations, raw data, and interactive tools to explore how publicly available facial recognition datasets contribute to a global supply chain of biometric data that powers the global facial recognition industry.

        + +The MegaPixels website is produced in partnership with [Mozilla](https://mozilla.org). + +
        +
        + +

        Adam Harvey

        +

        is Berlin-based American artist and researcher. His previous projects (CV Dazzle, Stealth Wear, and SkyLift) explore the potential for countersurveillance as artwork. He is the founder of VFRAME (visual forensics software for human rights groups), the recipient of 2 PrototypeFund grants, and is a researcher in residence at Karlsruhe HfG. +
        + ahprojects.com +

        +
        +
        + +

        Jules LaPlace

        +

        is an American creative technologist also based in Berlin. He was previously the CTO of a digital agency in NYC and now also works at VFRAME, developing computer vision for human rights groups. Jules also builds creative software for artists and musicians. +
        + https://asdf.us +

        +
        +
        diff --git a/site/content/pages/about/press.md b/site/content/pages/about/press.md index 759400e8..47e1af52 100644 --- a/site/content/pages/about/press.md +++ b/site/content/pages/about/press.md @@ -17,6 +17,7 @@ authors: Adam Harvey
        • About
        • Press
        • +
        • Credits
        • Disclaimer
        • Terms and Conditions
        • Privacy Policy
        • diff --git a/site/content/pages/about/privacy.md b/site/content/pages/about/privacy.md index b42ccb21..e36daf2a 100644 --- a/site/content/pages/about/privacy.md +++ b/site/content/pages/about/privacy.md @@ -17,6 +17,7 @@ authors: Adam Harvey
          • About
          • Press
          • +
          • Credits
          • Disclaimer
          • Terms and Conditions
          • Privacy Policy
          • diff --git a/site/content/pages/about/terms.md b/site/content/pages/about/terms.md index 40653292..7ae6dac7 100644 --- a/site/content/pages/about/terms.md +++ b/site/content/pages/about/terms.md @@ -18,6 +18,7 @@ authors: Adam Harvey
            • About
            • Press
            • +
            • Credits
            • Disclaimer
            • Terms and Conditions
            • Privacy Policy
            • diff --git a/site/content/pages/research/00_introduction/index.md b/site/content/pages/research/00_introduction/index.md index 9555ca9b..bcb3d57c 100644 --- a/site/content/pages/research/00_introduction/index.md +++ b/site/content/pages/research/00_introduction/index.md @@ -16,6 +16,22 @@ authors: Megapixels + Author: Adam Harvey + +### Motivation + +Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to developing and validating face recognition technologies. Today, these datasets no longer originate in labs, but instead from family photo albums posted on photo sharing sites, surveillance camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube. + +During the last year, hundreds of these facial analysis datasets created "in the wild" have been collected to understand how they contribute to a global supply chain of biometric data that is powering the global facial recognition industry. + +While many of these datasets include public figures such as politicians, athletes, and actors; they also include many non-public figures: digital activists, students, pedestrians, and semi-private shared photo albums are all considered "in the wild" and fair game for research projects. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research, but when examined further it becomes clear that they're also used by foreign defense agencies. + +The MegaPixels site is based on an earlier [installation](https://ahprojects.com/megapixels-glassroom) (also supported by Mozilla) at the [Tactical Tech Glassroom](https://theglassroom.org/) in London in 2017; and a commission from the Elevate arts festival curated by Berit Gilma about pedestrian recognition datasets in 2018, and research during [CV Dazzle](https://cvdazzle.com) from 2010-2015. Through the many prototypes, conversations, pitches, PDFs, and false starts this project has endured during the last 5 years, it eventually evolved into something much different than originally imagined. Now, as datasets become increasingly influential in shaping the computational future, it's clear that they must be critically analyzed to understand the biases, shortcomings, funding sources, and contributions to the surveillance industry. However, it's misguided to only criticize these datasets for their flaws without also praising their contribution to society. Without publicly available facial analysis datasets there would be less public discourse, less open-source software, and less peer-reviewed research. Public datasets can indeed become a vital public good for the information economy but as this projects aims to illustrate, many ethical questions arise about consent, intellectual property, surveillance, and privacy. + + + + + + Ever since the first computational facial recognition research project by the CIA in the early 1960s, data has always played a vital role in the development of our biometric future. Without facial recognition datasets there would be no facial recognition. Datasets are an indispensable part of any artificial intelligence system because, as Geoffrey Hinton points out: > Our relationship to computers has changed. Instead of programming them, we now show them and they figure it out. - [Geoffrey Hinton](https://www.youtube.com/watch?v=-eyhCTvrEtE) diff --git a/site/public/about/credits/index.html b/site/public/about/credits/index.html new file mode 100644 index 00000000..b7ab8085 --- /dev/null +++ b/site/public/about/credits/index.html @@ -0,0 +1,76 @@ + + + + MegaPixels + + + + + + + + + + + + +
              + + +
              MegaPixels
              +
              + +
              +
              + +

              Credits

              +
              + +

              Team

              +
                +
              • Research, concept: Adam Harvey
              • +
              • Site development, visualizations: Jules LaPlace
              • +
              • Assistant researcher: Berit Gilma
              • +
              • Produced in Partnership with Mozilla
              • +
              +

              Code

              +
                +
              • This site uses D3 and C2 for visuzations.
              • +
              • Add more here
              • +
              +

              Data

              +
                +
              • link to github
              • +
              • how it was gathered
              • +
              +
              + +
              + + + + + \ No newline at end of file diff --git a/site/public/about/disclaimer/index.html b/site/public/about/disclaimer/index.html index 25281a16..28588708 100644 --- a/site/public/about/disclaimer/index.html +++ b/site/public/about/disclaimer/index.html @@ -31,6 +31,7 @@

    (PAGE UNDER DEVELOPMENT)

    -

    Ever since government agencies began developing face recognition in the early 1960's, datasets of face images have always been central to the development and evaluation face recognition technology. Today, these datasets no longer originate in labs, but instead from family photo albums posted on photo sharing sites, surveillance camera footage from college campuses, search engine queries for celebrities, cafe livestreams, or videos on YouTube.

    While many of these datasets include public figures such as politicians, athletes, and actors; they also include many non-public figures including digital activists, students, pedestrians, and people's semi-private shared photo albums. Some images are used with creative commons licenses, yet others were taken in unconstrained scenarios without awareness or consent. At first glance it appears many of the datasets were created for seemingly harmless academic research, but when examined further it becomes clear that they're also used by foreign defense agencies.

    -

    During the last year, hundreds of these facial analysis datasets created "in the wild" have been collected to understand how they contribute to a global supply chain of biometric data that is powering the global facial recognition industry.

    -

    MegaPixels is art and research by Adam Harvey about publicly available facial recognition datasets that aims to unravel their histories, futures, geographies, and context. Throughout 2019 this site, coded and designed by Jules LaPlace, will publish research reports, visualizations, downloadable statistics, and interactive tools for searching the datasets.

    -

    The MegaPixels website is produced in partnership with Mozilla who provided funding to research the datasets, build the site, and develop tools to help you understand the role these datasets have played in creating biometric surveillance technologies.

    -

    MegaPixels Team

    -
    Adam Harvey

    Adam Harvey is Berlin-based American artist and researcher. His previous projects (CV Dazzle, Stealth Wear, and SkyLift) explore the potential for countersurveillance as artwork. He is the founder of VFRAME (visual forensics software for human rights groups), the recipient of 2 PrototypeFund awards, and is a researcher in residence at Karlsruhe HfG.
    ahprojects.com

    -
    Jules LaPlace

    Jules LaPlace is an American creative technologist also based in Berlin. He was previously the CTO of a digital agency in NYC and now also works at VFRAME, developing computer vision for human rights groups. Jules also builds creative software for artists and musicians.
    asdf.us

    -

    Additional Researchers

    -

    Additional research by Berit Gilma.

    -

    Project History

    -

    The MegaPixels site is based on an earlier installation (also supported by Mozilla) at the Tactical Tech Glassroom in London in 2017; and a commission from the Elevate arts festival curated by Berit Gilma about pedestrian recognition datasets in 2018, and research during CV Dazzle from 2010-2015. Through the many prototypes, conversations, pitches, PDFs, and false starts this project has endured during the last 5 years, it eventually evolved into something much different than originally imagined. Now, as datasets become increasingly influential in shaping the computational future, it's clear that they must be critically analyzed to understand the biases, shortcomings, funding sources, and contributions to the surveillance industry. However, it's misguided to only criticize these datasets for their flaws without also praising their contribution to society. Without publicly available facial analysis datasets there would be less public discourse, less open-source software, and less peer-reviewed research. Public datasets can indeed become a vital public good for the information economy but as this projects aims to illustrate, many ethical questions arise about consent, intellectual property, surveillance, and privacy.

    -
    +

    MegaPixels is art and research by Adam Harvey about facial recognition datasets that aims to unravel their histories, futures, geographies, and meanings. Throughout 2019 this site, coded by Jules LaPlace, will publish research reports, visualizations, raw data, and interactive tools to explore how publicly available facial recognition datasets contribute to a global supply chain of biometric data that powers the global facial recognition industry.

    The MegaPixels website is produced in partnership with Mozilla.

    +
    +
    + +

    Adam Harvey

    +

    is Berlin-based American artist and researcher. His previous projects (CV Dazzle, Stealth Wear, and SkyLift) explore the potential for countersurveillance as artwork. He is the founder of VFRAME (visual forensics software for human rights groups), the recipient of 2 PrototypeFund grants, and is a researcher in residence at Karlsruhe HfG. +
    + ahprojects.com +

    +
    +
    + +

    Jules LaPlace

    +

    is an American creative technologist also based in Berlin. He was previously the CTO of a digital agency in NYC and now also works at VFRAME, developing computer vision for human rights groups. Jules also builds creative software for artists and musicians. +
    + https://asdf.us +

    +
    +