From b73e233acec5ad6c3aca7475288482f366f7a31f Mon Sep 17 00:00:00 2001 From: adamhrv Date: Fri, 5 Apr 2019 13:17:05 +0200 Subject: never say final, update uccs --- .../datasets/50_people_one_question/index.html | 65 ++-- site/public/datasets/afad/index.html | 80 ++++- site/public/datasets/aflw/index.html | 7 +- site/public/datasets/brainwash/index.html | 62 ++-- site/public/datasets/caltech_10k/index.html | 88 ++++- site/public/datasets/celeba/index.html | 65 ++-- site/public/datasets/cofw/index.html | 95 ++++-- site/public/datasets/duke_mtmc/index.html | 159 +++------ site/public/datasets/facebook/index.html | 7 +- site/public/datasets/feret/index.html | 38 ++- site/public/datasets/hrt_transgender/index.html | 81 +---- site/public/datasets/index.html | 31 +- site/public/datasets/lfpw/index.html | 71 +++- site/public/datasets/lfw/index.html | 122 +++---- site/public/datasets/market_1501/index.html | 80 ++--- site/public/datasets/msceleb/index.html | 62 ++-- site/public/datasets/oxford_town_centre/index.html | 150 +++++++++ site/public/datasets/pipa/index.html | 68 ++-- site/public/datasets/pubfig/index.html | 117 +++++++ site/public/datasets/uccs/index.html | 364 ++++++++------------- site/public/datasets/vgg_face2/index.html | 89 ++++- site/public/datasets/viper/index.html | 50 +-- .../public/datasets/youtube_celebrities/index.html | 75 ++++- 23 files changed, 1167 insertions(+), 859 deletions(-) create mode 100644 site/public/datasets/oxford_town_centre/index.html create mode 100644 site/public/datasets/pubfig/index.html (limited to 'site/public/datasets') diff --git a/site/public/datasets/50_people_one_question/index.html b/site/public/datasets/50_people_one_question/index.html index 796af8d6..b27fa3e5 100644 --- a/site/public/datasets/50_people_one_question/index.html +++ b/site/public/datasets/50_people_one_question/index.html @@ -17,7 +17,7 @@
MegaPixels
-
50 People One Question
+
50 People One Question Dataset
Website
caltech.edu
-
Collected
TBD
Published
TBD
Images
TBD
Faces
TBD

50 People 1 Question

-

(PAGE UNDER DEVELOPMENT)

-

At vero eos et accusamus et iusto odio dignissimos ducimus, qui blanditiis praesentium voluptatum deleniti atque corrupti, quos dolores et quas molestias excepturi sint, obcaecati cupiditate non-provident, similique sunt in culpa, qui officia deserunt mollitia animi, id est laborum et dolorum fuga. Et harum quidem rerum facilis est et expedita distinctio.

-

Nam libero tempore, cum soluta nobis est eligendi optio, cumque nihil impedit, quo minus id, quod maxime placeat, facere possimus, omnis voluptas assumenda est, omnis dolor repellendus. Temporibus autem quibusdam et aut officiis debitis aut rerum necessitatibus saepe eveniet, ut et voluptates repudiandae sint et molestiae non-recusandae. Itaque earum rerum hic tenetur a sapiente delectus, ut aut reiciendis voluptatibus maiores alias consequatur aut perferendis doloribus asperiores repellat

+

50 People 1 Question

+

[ page under development ]

+

Who used 50 People One Question Dataset?

+ +

+ This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. +

+ +
+ +
+ +
+
+ +
+
+
+ +

Biometric Trade Routes

- +

- To help understand how 50 People One Question Dataset has been used around the world for commercial, military and academic research; publicly available research citing 50 People One Question is collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal reserach projects at that location. + To help understand how 50 People One Question Dataset has been used around the world by commercial, military, and academic organizations; existing publicly available research citing 50 People One Question was collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal research projects at that location.

-
@@ -72,25 +79,12 @@
  • Academic
  • Commercial
  • Military / Government
  • -
  • Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
  • +
    Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    -
    - -
    -
    -
    -
    -

    Supplementary Information

    - -
    +

    Dataset Citations

    @@ -104,11 +98,10 @@

    MegaPixels.cc - Disclaimer - Terms of Use - Privacy + Datasets About - Team + Press + Legal and Privacy
    MegaPixels ©2017-19 Adam R. Harvey /  diff --git a/site/public/datasets/afad/index.html b/site/public/datasets/afad/index.html index ac025a80..67a4e981 100644 --- a/site/public/datasets/afad/index.html +++ b/site/public/datasets/afad/index.html @@ -17,7 +17,7 @@
    MegaPixels
    -
    AFAD
    +
    Asian Face Age Dataset
     An sample image from the Brainwash dataset used for training face and head detection algorithms for surveillance. The datset contains about 12,000 images. License: Open Data Commons Public Domain Dedication (PDDL)
    An sample image from the Brainwash dataset used for training face and head detection algorithms for surveillance. The datset contains about 12,000 images. License: Open Data Commons Public Domain Dedication (PDDL)
     49 of the 11,918 images included in the Brainwash dataset. License: Open Data Commons Public Domain Dedication (PDDL)
    49 of the 11,918 images included in the Brainwash dataset. License: Open Data Commons Public Domain Dedication (PDDL)

    Additional Resources

    - -

    TODO

    +
     An sample image from the Brainwash dataset used for training face and head detection algorithms for surveillance. The datset contains about 12,000 images. License: Open Data Commons Public Domain Dedication (PDDL)
    An sample image from the Brainwash dataset used for training face and head detection algorithms for surveillance. The datset contains about 12,000 images. License: Open Data Commons Public Domain Dedication (PDDL)
     49 of the 11,918 images included in the Brainwash dataset. License: Open Data Commons Public Domain Dedication (PDDL)
    49 of the 11,918 images included in the Brainwash dataset. License: Open Data Commons Public Domain Dedication (PDDL)

    TODO

      +
    • include the images referenced in the chinese defence papers?
    • +
    • change supp images to 2x2 grid with bboxes
    • add bounding boxes to the header image
    • remake montage with randomized images, with bboxes
    • -
    • clean up intro text
    • -
    • verify quote citations
    • a

      "readme.txt" https://exhibits.stanford.edu/data/catalog/sx925dc9385.

      +
    • a

      Stewart, Russel. Andriluka, Mykhaylo. "End-to-end people detection in crowded scenes". 2016.

    • a

      Li, Y. and Dou, Y. and Liu, X. and Li, T. Localized Region Context and Object Feature Fusion for People Head Detection. ICIP16 Proceedings. 2016. Pages 594-598.

    • a

      Zhao. X, Wang Y, Dou, Y. A Replacement Algorithm of Non-Maximum Suppression Base on Graph Clustering.

    @@ -142,11 +129,10 @@
    MegaPixels.cc - Disclaimer - Terms of Use - Privacy + Datasets About - Team + Press + Legal and Privacy
    MegaPixels ©2017-19 Adam R. Harvey /  diff --git a/site/public/datasets/caltech_10k/index.html b/site/public/datasets/caltech_10k/index.html index 9aa0b2c3..10925b09 100644 --- a/site/public/datasets/caltech_10k/index.html +++ b/site/public/datasets/caltech_10k/index.html @@ -17,7 +17,7 @@
    MegaPixels
    - +
    Brainwash Dataset

    CelebA Dataset

    +

    [ PAGE UNDER DEVELOPMENT ]

    +

    Who used CelebA Dataset?

    + +

    + This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. +

    + +
    + +
    + +
    +
    + +
    +
    +
    + +

    Biometric Trade Routes

    - +

    - To help understand how CelebA Dataset has been used around the world for commercial, military and academic research; publicly available research citing Large-scale CelebFaces Attributes Dataset is collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal reserach projects at that location. + To help understand how CelebA Dataset has been used around the world by commercial, military, and academic organizations; existing publicly available research citing Large-scale CelebFaces Attributes Dataset was collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal research projects at that location.

    -
    @@ -78,25 +85,12 @@
  • Academic
  • Commercial
  • Military / Government
  • -
  • Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
  • +
    Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    -
    - -
    -
    -
    -
    -

    Supplementary Information

    - -
    +

    Dataset Citations

    @@ -116,11 +110,10 @@

    MegaPixels.cc - Disclaimer - Terms of Use - Privacy + Datasets About - Team + Press + Legal and Privacy
    MegaPixels ©2017-19 Adam R. Harvey /  diff --git a/site/public/datasets/cofw/index.html b/site/public/datasets/cofw/index.html index 8925d4b8..72f222c9 100644 --- a/site/public/datasets/cofw/index.html +++ b/site/public/datasets/cofw/index.html @@ -17,7 +17,7 @@
    MegaPixels
    -
    COFW
    +
    COFW Dataset
    Website
    -
    Years
    1993-1996
    Images
    14,126
    Identities
    1,199
    Origin
    Web Searches
    Funded by
    ODNI, IARPA, Microsoft

    Caltech Occluded Faces in the Wild

    -

    (PAGE UNDER DEVELOPMENT)

    -

    COFW is "is designed to benchmark face landmark algorithms in realistic conditions, which include heavy occlusions and large shape variations" [Robust face landmark estimation under occlusion].

    -

    RESEARCH below this line

    +

    Caltech Occluded Faces in the Wild

    +

    [ PAGE UNDER DEVELOPMENT ]

    +
    +

    Who used COFW Dataset?

    + +

    + This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. +

    + +
    + +
    + +
    +
    + +
    +
    +
    + +
    + +

    Biometric Trade Routes

    + +

    + To help understand how COFW Dataset has been used around the world by commercial, military, and academic organizations; existing publicly available research citing Caltech Occluded Faces in the Wild was collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal research projects at that location. +

    + +
    + +
    +
    +
    + +
    +
      +
    • Academic
    • +
    • Commercial
    • +
    • Military / Government
    • +
    +
    Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    +
    + + +
    + +

    Dataset Citations

    +

    + The dataset citations used in the visualizations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Each citation was geocoded using names of institutions found in the PDF front matter, or as listed on other resources. These papers have been manually verified to show that researchers downloaded and used the dataset to train or test machine learning algorithms. +

    + +
    +

    (ignore) research notes

    +
    Years
    1993-1996
    Images
    14,126
    Identities
    1,199
    Origin
    Web Searches
    Funded by
    ODNI, IARPA, Microsoft

    COFW is "is designed to benchmark face landmark algorithms in realistic conditions, which include heavy occlusions and large shape variations" [Robust face landmark estimation under occlusion].

    We asked four people with different levels of computer vision knowledge to each collect 250 faces representative of typical real-world images, with the clear goal of challenging computer vision methods. The result is 1,007 images of faces obtained from a variety of sources.

    @@ -56,25 +107,15 @@ To increase the number of training images, and since COFW has the exact same la

    Biometric Trade Routes

    - +

    - To help understand how COFW Dataset has been used around the world for commercial, military and academic research; publicly available research citing Caltech Occluded Faces in the Wild is collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal reserach projects at that location. + To help understand how COFW Dataset has been used around the world by commercial, military, and academic organizations; existing publicly available research citing Caltech Occluded Faces in the Wild was collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the location markers to reveal research projects at that location.

    -
    @@ -82,23 +123,16 @@ To increase the number of training images, and since COFW has the exact same la
  • Academic
  • Commercial
  • Military / Government
  • -
  • Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
  • -
    - -
    +
    Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    +
    -

    Supplementary Information

    +

    Supplementary Information

    @@ -129,11 +163,10 @@ To increase the number of training images, and since COFW has the exact same la
    MegaPixels ©2017-19 Adam R. Harvey /  diff --git a/site/public/datasets/duke_mtmc/index.html b/site/public/datasets/duke_mtmc/index.html index 37de48ad..62e5d836 100644 --- a/site/public/datasets/duke_mtmc/index.html +++ b/site/public/datasets/duke_mtmc/index.html @@ -17,7 +17,7 @@
    MegaPixels
    -
    Duke MTMC
    +
    Duke MTMC Dataset
    Website
    -
    Created
    2014
    Identities
    Over 2,700
    Used for
    Face recognition, person re-identification
    Created by
    Computer Science Department, Duke University, Durham, US
    Website

    Duke Multi-Target, Multi-Camera Tracking Dataset (Duke MTMC)

    -

    [ PAGE UNDER DEVELOPMENT ]

    -

    Duke MTMC is a dataset of video recorded on Duke University campus during for the purpose of training, evaluating, and improving multi-target multi-camera tracking. The videos were recorded during February and March 2014 and cinclude

    -

    Includes a total of 888.8 minutes of video (ind. verified)

    -

    "We make available a new data set that has more than 2 million frames and more than 2,700 identities. It consists of 8×85 minutes of 1080p video recorded at 60 frames per second from 8 static cameras deployed on the Duke University campus during periods between lectures, when pedestrian traffic is heavy."

    -

    The dataset includes approximately 2,000 annotated identities appearing in 85 hours of video from 8 cameras located throughout Duke University's campus.

    -
     Duke MTMC pixel-averaged image of camera #5 is shown with the bounding boxes for each student drawn in white. (c) Adam Harvey
    Duke MTMC pixel-averaged image of camera #5 is shown with the bounding boxes for each student drawn in white. (c) Adam Harvey

    According to the dataset authors,

    +

    Duke MTMC

    +

    [ page under development ]

    +

    The Duke Multi-Target, Multi-Camera Tracking Dataset (MTMC) is a dataset of video recorded on Duke University campus during for the purpose of training, evaluating, and improving multi-target multi-camera tracking for surveillance. The dataset includes over 14 hours of 1080p video from 8 cameras positioned around Duke's campus during February and March 2014. Over 2,700 unique people are included in the dataset, which has become of the most widely used person re-identification image datasets.

    +

    The 8 cameras deployed on Duke's campus were specifically setup to capture students "during periods between lectures, when pedestrian traffic is heavy".

    +

    Who used Duke MTMC Dataset?

    + +

    + This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. +

    + +
    + +
    + +
    +
    + +
    +
    +
    + +

    Biometric Trade Routes

    - +

    - To help understand how Duke MTMC Dataset has been used around the world for commercial, military and academic research; publicly available research citing Duke Multi-Target, Multi-Camera Tracking Project is collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal reserach projects at that location. + To help understand how Duke MTMC Dataset has been used around the world by commercial, military, and academic organizations; existing publicly available research citing Duke Multi-Target, Multi-Camera Tracking Project was collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal research projects at that location.

    -
    @@ -81,30 +87,19 @@
  • Academic
  • Commercial
  • Military / Government
  • -
  • Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
  • +
    Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    -
    -

    Who used Duke MTMC Dataset?

    +
    + +

    Dataset Citations

    - This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. + The dataset citations used in the visualizations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Each citation was geocoded using names of institutions found in the PDF front matter, or as listed on other resources. These papers have been manually verified to show that researchers downloaded and used the dataset to train or test machine learning algorithms.

    - -
    -
    - -
    -
    -
    +
    @@ -112,93 +107,29 @@
    -

    Supplementary Information

    +

    Supplementary Information

    -
    - -

    Dataset Citations

    -

    - The dataset citations used in the visualizations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Each citation was geocoded using names of institutions found in the PDF front matter, or as listed on other resources. These papers have been manually verified to show that researchers downloaded and used the dataset to train or test machine learning algorithms. -

    - -
    -

    Research Notes

    +

    Data Visualizations

    +
     Duke MTMC pixel-averaged image of camera #5 is shown with the bounding boxes for each student drawn in white. (c) Adam Harvey
    Duke MTMC pixel-averaged image of camera #5 is shown with the bounding boxes for each student drawn in white. (c) Adam Harvey

    TODO

      -
    • "We make available a new data set that has more than 2 million frames and more than 2,700 identities. It consists of 8×85 minutes of 1080p video recorded at 60 frames per second from 8 static cameras deployed on the Duke University campus during periods between lectures, when pedestrian traffic is heavy." - 27a2fad58dd8727e280f97036e0d2bc55ef5424c
    • -
    • "This work was supported in part by the EPSRC Programme Grant (FACER2VM) EP/N007743/1, EPSRC/dstl/MURI project EP/R018456/1, the National Natural Science Foundation of China (61373055, 61672265, 61602390, 61532009, 61571313), Chinese Ministry of Education (Z2015101), Science and Technology Department of Sichuan Province (2017RZ0009 and 2017FZ0029), Education Department of Sichuan Province (15ZB0130), the Open Research Fund from Province Key Laboratory of Xihua University (szjj2015-056) and the NVIDIA GPU Grant Program." - ec9c20ed6cce15e9b63ac96bb5a6d55e69661e0b
    • -
    • "DukeMTMC aims to accelerate advances in multi-target multi-camera tracking. It provides a tracking system that works within and across cameras, a new large scale HD video data set recorded by 8 synchronized cameras with more than 7,000 single camera trajectories and over 2,000 unique identities, and a new performance evaluation method that measures how often a system is correct about who is where"
    • -
    • DukeMTMC is a new, manually annotated, calibrated, multi-camera data set recorded outdoors on the Duke University campus with 8 synchronized cameras. It consists of:

      -

      8 static cameras x 85 minutes of 1080p 60 fps video - More than 2,000,000 manually annotated frames - More than 2,000 identities - Manual annotation by 5 people over 1 year - More identities than all existing MTMC datasets combined - Unconstrained paths, diverse appearance

      -
    • -
    • DukeMTMC Project -Ergys Ristani Ergys Ristani Ergys Ristani Ergys Ristani Ergys Ristani
    • +
    • change to heatmap overlay of each location
    • +
    • make fancy viz of foot trails with bbox and blurred persons
    • +
    • expand story
    • +
    • add google street view images of each camera location?
    • +
    • add actual head detections to header image with faces blurred
    • +
    • add 4 diverse example images with faces blurred
    • +
    • add map location of the brainwash cafe
    -

    People involved: -Ergys Ristani, Francesco Solera, Roger S. Zou, Rita Cucchiara, Carlo Tomasi.

    -

    Navigation:

    -

    Data Set - Downloads - Downloads - Dataset Extensions - Performance Measures - Tracking Systems - Publications - How to Cite - Contact

    -

    Welcome to the Duke Multi-Target, Multi-Camera Tracking Project.

    -

    DukeMTMC aims to accelerate advances in multi-target multi-camera tracking. It provides a tracking system that works within and across cameras, a new large scale HD video data set recorded by 8 synchronized cameras with more than 7,000 single camera trajectories and over 2,000 unique identities, and a new performance evaluation method that measures how often a system is correct about who is where. -DukeMTMC Data Set -Snapshot from the DukeMTMC data set.

    -

    DukeMTMC is a new, manually annotated, calibrated, multi-camera data set recorded outdoors on the Duke University campus with 8 synchronized cameras. It consists of:

    -

    8 static cameras x 85 minutes of 1080p 60 fps video - More than 2,000,000 manually annotated frames - More than 2,000 identities - Manual annotation by 5 people over 1 year - More identities than all existing MTMC datasets combined - Unconstrained paths, diverse appearance

    -

    News

    -

    05 Feb 2019 We are organizing the 2nd Workshop on MTMCT and ReID at CVPR 2019 - 25 Jul 2018: The code for DeepCC is available on github - 28 Feb 2018: OpenPose detections now available for download - 19 Feb 2018: Our DeepCC tracker has been accepted to CVPR 2018 - 04 Oct 2017: A new blog post describes ID measures of performance - 26 Jul 2017: Slides from the BMTT 2017 workshop are now available - 09 Dec 2016: DukeMTMC is now hosted on MOTChallenge

    -

    DukeMTMC Downloads

    -

    DukeMTMC dataset (tracking)

    -

    Dataset Extensions

    -

    Below is a list of dataset extensions provided by the community:

    -

    DukeMTMC-VideoReID (download) - DukeMTMC-reID (download) - DukeMTMC4REID - DukeMTMC-attribute

    -

    If you use or extend DukeMTMC, please refer to the license terms. -DukeMTMCT Benchmark

    -

    DukeMTMCT is a tracking benchmark hosted on motchallenge.net. Click here for the up-to-date rankings. Here you will find the official motchallenge-devkit used for evaluation by MOTChallenge. For detailed instructions how to submit on motchallenge you can refer to this link.

    -

    Trackers are ranked using our identity-based measures which compute how often the system is correct about who is where, regardless of how often a target is lost and reacquired. Our measures are useful in applications such as security, surveillance or sports. This short post describes our measures with illustrations, while for details you can refer to the original paper. -Tracking Systems

    -

    We provide code for the following tracking systems which are all based on Correlation Clustering optimization:

    -

    DeepCC for single- and multi-camera tracking [1] - Single-Camera Tracker (demo video) [2] - Multi-Camera Tracker (demo video, failure cases) [2] - People-Groups Tracker [3] - Original Single-Camera Tracker [4]

    MegaPixels ©2017-19 Adam R. Harvey /  diff --git a/site/public/datasets/facebook/index.html b/site/public/datasets/facebook/index.html index b2943e1f..be413510 100644 --- a/site/public/datasets/facebook/index.html +++ b/site/public/datasets/facebook/index.html @@ -38,11 +38,10 @@
    MegaPixels ©2017-19 Adam R. Harvey /  diff --git a/site/public/datasets/feret/index.html b/site/public/datasets/feret/index.html index 45510f64..5cd29c4c 100644 --- a/site/public/datasets/feret/index.html +++ b/site/public/datasets/feret/index.html @@ -26,13 +26,34 @@
    -

    FERET

    -
    Years
    1993-1996
    Images
    14,126
    Identities
    1,199
    Origin
    Fairfax, MD

    Facial Recognition Evaluation (FERET) is develop, test, and evaluate face recognition algorithms

    -

    The goal of the FERET program was to develop automatic face recognition capabilities that could be employed to assist security, intelligence, and law enforcement personnel in the performance of their duties.

    +

    Funding

    The FERET program is sponsored by the U.S. Depart- ment of Defense’s Counterdrug Technology Development Program Office. The U.S. Army Research Laboratory (ARL) is the technical agent for the FERET program. ARL designed, administered, and scored the FERET tests. George Mason University collected, processed, and main- tained the FERET database. Inquiries regarding the FERET database or test should be directed to P. Jonathon Phillips.

    @@ -50,11 +71,10 @@

    HRT Transgender Dataset

    -
    -

    Who used HRT Transgender?

    - -

    - This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. -

    - -
    - -
    - -
    -
    -
    -
    - -

    Biometric Trade Routes

    - -

    - To help understand how HRT Transgender has been used around the world for commercial, military and academic research; publicly available research citing HRT Transgender Dataset is collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal reserach projects at that location. -

    - -
    - -
    -
    - -
    - -
    -
      -
    • Academic
    • -
    • Commercial
    • -
    • Military / Government
    • -
    • Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    • -
    -
    - -
    - -
    -
    -
    -
    - -

    Supplementary Information

    - -
    - -

    Dataset Citations

    -

    - The dataset citations used in the visualizations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Each citation was geocoded using names of institutions found in the PDF front matter, or as listed on other resources. These papers have been manually verified to show that researchers downloaded and used the dataset to train or test machine learning algorithms. -

    - -
    +

    HRT Transgender Dataset

    +

    [ page under development ]

    +

    {% include 'dashboard.html' }

    MegaPixels ©2017-19 Adam R. Harvey /  diff --git a/site/public/datasets/index.html b/site/public/datasets/index.html index f4776f6a..98db503e 100644 --- a/site/public/datasets/index.html +++ b/site/public/datasets/index.html @@ -61,18 +61,6 @@
    - -
    - Labeled Faces in The Wild -
    -
    2007
    -
    face recognition
    -
    13,233 images
    -
    5,749
    -
    -
    -
    -
    Market-1501 @@ -97,14 +85,14 @@
    - +
    - People in Photo Albums + Oxford Town Centre
    -
    2015
    -
    Face recognition
    -
    37,107 images
    -
    2,356
    +
    2011
    +
    Person detection, gaze estimation
    +
    images
    +
    @@ -141,11 +129,10 @@
    MegaPixels ©2017-19 Adam R. Harvey /  diff --git a/site/public/datasets/lfpw/index.html b/site/public/datasets/lfpw/index.html index 77189ce7..005b7aaa 100644 --- a/site/public/datasets/lfpw/index.html +++ b/site/public/datasets/lfpw/index.html @@ -26,8 +26,68 @@
    -

    Labeled Face Parts in The Wild

    -
    Year
    2011
    Images
    1,432
    Origin
    Flickr
    Funding
    CIA

    RESEARCH below this line

    +

    Labeled Face Parts in The Wild

    +
    +

    Who used LFWP?

    + +

    + This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. +

    + +
    + +
    + +
    +
    + +
    +
    +
    + +
    + +

    Biometric Trade Routes

    + +

    + To help understand how LFWP has been used around the world by commercial, military, and academic organizations; existing publicly available research citing Labeled Face Parts in the Wild was collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal research projects at that location. +

    + +
    + +
    +
    +
    + +
    +
      +
    • Academic
    • +
    • Commercial
    • +
    • Military / Government
    • +
    +
    Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    +
    + + +
    + +

    Dataset Citations

    +

    + The dataset citations used in the visualizations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Each citation was geocoded using names of institutions found in the PDF front matter, or as listed on other resources. These papers have been manually verified to show that researchers downloaded and used the dataset to train or test machine learning algorithms. +

    + +
    +

    RESEARCH below this line

    Release 1 of LFPW consists of 1,432 faces from images downloaded from the web using simple text queries on sites such as google.com, flickr.com, and yahoo.com. Each image was labeled by three MTurk workers, and 29 fiducial points, shown below, are included in dataset. LFPW was originally described in the following publication:

    Due to copyright issues, we cannot distribute image files in any format to anyone. Instead, we have made available a list of image URLs where you can download the images yourself. We realize that this makes it impossible to exactly compare numbers, as image links will slowly disappear over time, but we have no other option. This seems to be the way other large web-based databases seem to be evolving.

    @@ -40,11 +100,10 @@
    MegaPixels ©2017-19 Adam R. Harvey /  diff --git a/site/public/datasets/lfw/index.html b/site/public/datasets/lfw/index.html index d451d0cd..cb487913 100644 --- a/site/public/datasets/lfw/index.html +++ b/site/public/datasets/lfw/index.html @@ -42,45 +42,45 @@
    Website
    -
    Created
    2002 – 2004
    Images
    13,233
    Identities
    5,749
    Origin
    Yahoo! News Images
    Used by
    Facebook, Google, Microsoft, Baidu, Tencent, SenseTime, Face++, CIA, NSA, IARPA
    Website
      -
    • There are about 3 men for every 1 woman in the LFW dataset 1
    • -
    • The person with the most images is George W. Bush with 530
    • -
    • There are about 3 George W. Bush's for every 1 Tony Blair
    • -
    • The LFW dataset includes over 500 actors, 30 models, 10 presidents, 124 basketball players, 24 football players, 11 kings, 7 queens, and 1 Moby
    • -
    • In all 3 of the LFW publications [^lfw_original_paper], [^lfw_survey], [^lfw_tech_report] the words "ethics", "consent", and "privacy" appear 0 times
    • -
    • The word "future" appears 71 times
    • -
    • * denotes partial funding for related research
    • -
    -

    Labeled Faces in the Wild

    -

    (PAGE UNDER DEVELOPMENT)

    -

    Labeled Faces in The Wild (LFW) is "a database of face photographs designed for studying the problem of unconstrained face recognition 1. It is used to evaluate and improve the performance of facial recognition algorithms in academic, commercial, and government research. According to BiometricUpdate.com 3, LFW is "the most widely used evaluation set in the field of facial recognition, LFW attracts a few dozen teams from around the globe including Google, Facebook, Microsoft Research Asia, Baidu, Tencent, SenseTime, Face++ and Chinese University of Hong Kong."

    +

    Labeled Faces in the Wild

    +

    [ PAGE UNDER DEVELOPMENT ]

    +

    Labeled Faces in The Wild (LFW) is "a database of face photographs designed for studying the problem of unconstrained face recognition 1. It is used to evaluate and improve the performance of facial recognition algorithms in academic, commercial, and government research. According to BiometricUpdate.com 3, LFW is "the most widely used evaluation set in the field of facial recognition, LFW attracts a few dozen teams from around the globe including Google, Facebook, Microsoft Research Asia, Baidu, Tencent, SenseTime, Face++ and Chinese University of Hong Kong."

    The LFW dataset includes 13,233 images of 5,749 people that were collected between 2002-2004. LFW is a subset of Names of Faces and is part of the first facial recognition training dataset created entirely from images appearing on the Internet. The people appearing in LFW are...

    The Names and Faces dataset was the first face recognition dataset created entire from online photos. However, Names and Faces and LFW are not the first face recognition dataset created entirely "in the wild". That title belongs to the UCD dataset. Images obtained "in the wild" means using an image without explicit consent or awareness from the subject or photographer.

    The Names and Faces dataset was the first face recognition dataset created entire from online photos. However, Names and Faces and LFW are not the first face recognition dataset created entirely "in the wild". That title belongs to the UCD dataset. Images obtained "in the wild" means using an image without explicit consent or awareness from the subject or photographer.

    All 5,379 people in the Labeled Faces in The Wild Dataset. Showing one face per person
    All 5,379 people in the Labeled Faces in The Wild Dataset. Showing one face per person

    The Names and Faces dataset was the first face recognition dataset created entire from online photos. However, Names and Faces and LFW are not the first face recognition dataset created entirely "in the wild". That title belongs to the UCD dataset. Images obtained "in the wild" means using an image without explicit consent or awareness from the subject or photographer.

    The Names and Faces dataset was the first face recognition dataset created entire from online photos. However, Names and Faces and LFW are not the first face recognition dataset created entirely "in the wild". That title belongs to the UCD dataset. Images obtained "in the wild" means using an image without explicit consent or awareness from the subject or photographer.

    +

    Who used LFW?

    + +

    + This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. +

    + +
    + +
    + +
    +
    + +
    +
    +
    + +

    Biometric Trade Routes

    - +

    - To help understand how LFW has been used around the world for commercial, military and academic research; publicly available research citing Labeled Faces in the Wild is collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal reserach projects at that location. + To help understand how LFW has been used around the world by commercial, military, and academic organizations; existing publicly available research citing Labeled Faces in the Wild was collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal research projects at that location.

    -
    @@ -88,30 +88,19 @@
  • Academic
  • Commercial
  • Military / Government
  • -
  • Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
  • +
    Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    -
    -

    Who used LFW?

    +
    + +

    Dataset Citations

    - This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. + The dataset citations used in the visualizations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Each citation was geocoded using names of institutions found in the PDF front matter, or as listed on other resources. These papers have been manually verified to show that researchers downloaded and used the dataset to train or test machine learning algorithms.

    - -
    -
    - -
    -
    -
    +
    @@ -119,33 +108,52 @@
    -

    Supplementary Information

    +

    Supplementary Information

    -
    - -

    Dataset Citations

    -

    - The dataset citations used in the visualizations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Each citation was geocoded using names of institutions found in the PDF front matter, or as listed on other resources. These papers have been manually verified to show that researchers downloaded and used the dataset to train or test machine learning algorithms. -

    - -

    Commercial Use

    Add a paragraph about how usage extends far beyond academia into research centers for largest companies in the world. And even funnels into CIA funded research in the US and defense industry usage in China.

    -

    Research, text, and graphics ©Adam Harvey / megapixels.cc

    +

    Research

    +
      +
    • "In our experiments, we used 10000 images and associated captions from the Faces in the wilddata set [3]."
    • +
    • "This work was supported in part by the Center for Intelligent Information Retrieval, the Central Intelligence Agency, the National Security Agency and National Science Foundation under CAREER award IIS-0546666 and grant IIS-0326249."
    • +
    • From: "People-LDA: Anchoring Topics to People using Face Recognition" https://www.semanticscholar.org/paper/People-LDA%3A-Anchoring-Topics-to-People-using-Face-Jain-Learned-Miller/10f17534dba06af1ddab96c4188a9c98a020a459 and https://ieeexplore.ieee.org/document/4409055
    • +
    • This paper was presented at IEEE 11th ICCV conference Oct 14-21 and the main LFW paper "Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments" was also published that same year
    • +
    • 10f17534dba06af1ddab96c4188a9c98a020a459
    • +
    • This research is based upon work supported in part by the Office of the Director of National Intelligence (ODNI), Intelligence Advanced Research Projects Activity (IARPA), via contract number 2014-14071600010.
    • +
    • From "Labeled Faces in the Wild: Updates and New Reporting Procedures"
    • +
    • 70% of people in the dataset have only 1 image and 29% have 2 or more images
    • +
    • The LFW dataset is considered the "most popular benchmark for face recognition" 2
    • +
    • The LFW dataset is "the most widely used evaluation set in the field of facial recognition" 3
    • +
    • All images in LFW dataset were obtained "in the wild" meaning without any consent from the subject or from the photographer
    • +
    • The faces in the LFW dataset were detected using the Viola-Jones haarcascade face detector [^lfw_website] [^lfw-survey]
    • +
    • The LFW dataset is used by several of the largest tech companies in the world including "Google, Facebook, Microsoft Research Asia, Baidu, Tencent, SenseTime, Face++ and Chinese University of Hong Kong." 3
    • +
    • All images in the LFW dataset were copied from Yahoo News between 2002 - 2004
    • +
    • In 2014, two of the four original authors of the LFW dataset received funding from IARPA and ODNI for their followup paper Labeled Faces in the Wild: Updates and New Reporting Procedures via IARPA contract number 2014-14071600010
    • +
    • The dataset includes 2 images of George Tenet, the former Director of Central Intelligence (DCI) for the Central Intelligence Agency whose facial biometrics were eventually used to help train facial recognition software in China and Russia
    • +
    • ./15/155205b8e288fd49bf203135871d66de879c8c04/paper.txt shows usage by DSTO Australia, supported parimal@iisc.ac.in
    • +
    +
    Created
    2002 – 2004
    Images
    13,233
    Identities
    5,749
    Origin
    Yahoo! News Images
    Used by
    Facebook, Google, Microsoft, Baidu, Tencent, SenseTime, Face++, CIA, NSA, IARPA
    Website
      +
    • There are about 3 men for every 1 woman in the LFW dataset 1
    • +
    • The person with the most images is George W. Bush with 530
    • +
    • There are about 3 George W. Bush's for every 1 Tony Blair
    • +
    • The LFW dataset includes over 500 actors, 30 models, 10 presidents, 124 basketball players, 24 football players, 11 kings, 7 queens, and 1 Moby
    • +
    • In all 3 of the LFW publications [^lfw_original_paper], [^lfw_survey], [^lfw_tech_report] the words "ethics", "consent", and "privacy" appear 0 times
    • +
    • The word "future" appears 71 times
    • +
    • * denotes partial funding for related research
    • +
    MegaPixels ©2017-19 Adam R. Harvey /  diff --git a/site/public/datasets/market_1501/index.html b/site/public/datasets/market_1501/index.html index 3281a9ae..059b1a49 100644 --- a/site/public/datasets/market_1501/index.html +++ b/site/public/datasets/market_1501/index.html @@ -4,7 +4,7 @@ MegaPixels - + @@ -26,7 +26,7 @@
    -
    Market-1501 is a dataset is collection of CCTV footage from ...
    The Market-1501 dataset includes ... +
    Market-1501 is a dataset is collection of CCTV footage from Tsinghua University
    The Market-1501 dataset includes 1,261 people from 5 HD surveillance cameras located on campus

    Market-1501 ...

    -

    (PAGE UNDER DEVELOPMENT)

    +

    Market-1501 Dataset

    +

    [ PAGE UNDER DEVELOPMENT]

    +

    Who used Market 1501?

    + +

    + This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. +

    + +
    + +
    + +
    +
    + +
    +
    +
    + +

    Biometric Trade Routes

    - +

    - To help understand how Market 1501 has been used around the world for commercial, military and academic research; publicly available research citing Market 1501 Dataset is collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal reserach projects at that location. + To help understand how Market 1501 has been used around the world by commercial, military, and academic organizations; existing publicly available research citing Market 1501 Dataset was collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal research projects at that location.

    -
    @@ -73,40 +82,12 @@
  • Academic
  • Commercial
  • Military / Government
  • -
  • Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
  • +
    Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    -
    -

    Who used Market 1501?

    - -

    - This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. -

    - -
    - -
    -
    -
    -
    - -
    -
    -
    -
    - -

    Supplementary Information

    - -

    Dataset Citations

    @@ -114,7 +95,7 @@

    -

    Research Notes

    +

    (ignore) research Notes

    • "MARS is an extension of the Market-1501 dataset. During collection, we placed six near synchronized cameras in the campus of Tsinghua university. There were Five 1,0801920 HD cameras and one 640480 SD camera. MARS consists of 1,261 different pedestrians whom are captured by at least 2 cameras. Given a query tracklet, MARS aims to retrieve tracklets that contain the same ID." - main paper
    • bbox "0065C1T0002F0016.jpg", "0065" is the ID of the pedestrian. "C1" denotes the first @@ -135,11 +116,10 @@ organization={Springer}

    Microsoft Celeb Dataset (MS Celeb)

    -

    (PAGE UNDER DEVELOPMENT)

    -

    At vero eos et accusamus et iusto odio dignissimos ducimus, qui blanditiis praesentium voluptatum deleniti atque corrupti, quos dolores et quas molestias excepturi sint, obcaecati cupiditate non-provident, similique sunt in culpa, qui officia deserunt mollitia animi, id est laborum et dolorum fuga. Et harum quidem rerum facilis est et expedita distinctio.

    -

    Nam libero tempore, cum soluta nobis est eligendi optio, cumque nihil impedit, quo minus id, quod maxime placeat, facere possimus, omnis voluptas assumenda est, omnis dolor repellendus. Temporibus autem quibusdam et aut officiis debitis aut rerum necessitatibus saepe eveniet, ut et voluptates repudiandae sint et molestiae non-recusandae. Itaque earum rerum hic tenetur a sapiente delectus, ut aut reiciendis voluptatibus maiores alias consequatur aut perferendis doloribus asperiores repellat

    +

    Microsoft Celeb Dataset (MS Celeb)

    +

    [ PAGE UNDER DEVELOPMENT ]

    Who used MsCeleb?

    @@ -65,30 +63,24 @@
    -
    +
    + +
    -
    +
    + +

    Biometric Trade Routes

    - +

    - To help understand how MsCeleb has been used around the world for commercial, military and academic research; publicly available research citing Microsoft Celebrity Dataset is collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal reserach projects at that location. + To help understand how MsCeleb has been used around the world by commercial, military, and academic organizations; existing publicly available research citing Microsoft Celebrity Dataset was collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal research projects at that location.

    -
    @@ -96,26 +88,12 @@
  • Academic
  • Commercial
  • Military / Government
  • -
  • Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
  • +
    Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    -

    Add more analysis here

    -
    -
    -
    -
    -
    - -

    Supplementary Information

    - -
    +

    Dataset Citations

    @@ -123,6 +101,15 @@

    +
    + +
    +
    +
    +
    + +

    Supplementary Information

    +

    Additional Information

    • The dataset author spoke about his research at the CVPR conference in 2016 https://www.youtube.com/watch?v=Nl2fBKxwusQ
    • @@ -136,11 +123,10 @@
      MegaPixels ©2017-19 Adam R. Harvey /  diff --git a/site/public/datasets/oxford_town_centre/index.html b/site/public/datasets/oxford_town_centre/index.html new file mode 100644 index 00000000..db62a5a6 --- /dev/null +++ b/site/public/datasets/oxford_town_centre/index.html @@ -0,0 +1,150 @@ + + + + MegaPixels + + + + + + + + + + + +
      + + +
      MegaPixels
      +
      TownCentre
      +
      + +
      +
      + +
      Oxford Town Centre is a dataset of surveillance camera footage from Cornmarket St Oxford, England
      The Oxford Town Centre dataset includes +

      Oxford Town Centre

      +

      [ page under development ]

      +

      The Oxford Town Centre dataset is a video of pedestrians in a busy downtown area in Oxford used for creating surveillance algorithms with "potential applications in activity recognition and remote biometric analysis" or non-cooperative face recognition. 1

      +

      Based on observations of the dataset video and Google Street images, the source of the footage has been geolocated to a public CCTV camera at the intersection of Cornmarket and Market St. Oxford, England (map). Based on an analysis of the papers that use or cite this dataset 2 the inferred year of capture was definitely 2009 and the season was perhaps February or March based on the the window advertisements and cool-weather clothing.

      +

      Halfway through the video a peculiar and somewhat rude man enters the video and stands directly over top a water drain for over a minute. His unusual demeanor and apparently scripted behavior suggests a possible relationship to the CCTV operators.

      +

      Although Oxford Town Centre dataset first appears as a pedestrian dataset, it was created to improve the stabilization of pedstrian detections in order to extract a more accurate head region that would lead to improvements in face recognition.

      +
       Footage from this public CCTV camera was used to create the Oxford Town Centre dataset. Image source Google Sreet View
      Footage from this public CCTV camera was used to create the Oxford Town Centre dataset. Image source Google Sreet View
      +

      Who used TownCentre?

      + +

      + This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. +

      + +
      + +
      + +
      +
      + +
      +
      +
      + +
      + +

      Biometric Trade Routes

      + +

      + To help understand how TownCentre has been used around the world by commercial, military, and academic organizations; existing publicly available research citing Oxford Town Centre was collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal research projects at that location. +

      + +
      + +
      +
      +
      + +
      +
        +
      • Academic
      • +
      • Commercial
      • +
      • Military / Government
      • +
      +
      Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
      +
      + + +
      + +

      Dataset Citations

      +

      + The dataset citations used in the visualizations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Each citation was geocoded using names of institutions found in the PDF front matter, or as listed on other resources. These papers have been manually verified to show that researchers downloaded and used the dataset to train or test machine learning algorithms. +

      + +
      +
      + +
      +
      +
      +
      + +

      Supplementary Information

      + +

      Several researchers have posted their demo videos using the Oxford Town Centre dataset on YouTube:

      + +

      [ add visualization ]

      +

      TODO

      +
        +
      • make visualization
      • +
      • add license info
      • +
      +
      • a

        Benfold, Ben and Reid, Ian. "Stable Multi-Target Tracking in Real-Time Surveillance Video". CVPR 2011. Pages 3457-3464.

        +
      • a

        "Guiding Visual Surveillance by Tracking Human Attention". 2009.

        +
      + +
      + + + + + \ No newline at end of file diff --git a/site/public/datasets/pipa/index.html b/site/public/datasets/pipa/index.html index 27168c5c..7a4fbc0e 100644 --- a/site/public/datasets/pipa/index.html +++ b/site/public/datasets/pipa/index.html @@ -4,7 +4,7 @@ MegaPixels - + @@ -17,7 +17,7 @@
      MegaPixels
      -
      PIPA
      +
      PIPA Dataset
    +
    Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    -
    - -
    -
    -
    -
    -

    Supplementary Information

    - -
    +

    Dataset Citations

    @@ -102,18 +98,16 @@

    -

    Research Notes

    MegaPixels ©2017-19 Adam R. Harvey /  diff --git a/site/public/datasets/pubfig/index.html b/site/public/datasets/pubfig/index.html new file mode 100644 index 00000000..c46eeea3 --- /dev/null +++ b/site/public/datasets/pubfig/index.html @@ -0,0 +1,117 @@ + + + + MegaPixels + + + + + + + + + + + +
    + + +
    MegaPixels
    +
    PubFig
    +
    + +
    +
    + +
    PubFig is a dataset...
    [ add subdescrition ] +

    PubFig

    +

    [ PAGE UNDER DEVELOPMENT ]

    +
    +

    Who used PubFig?

    + +

    + This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. +

    + +
    + +
    + +
    +
    + +
    +
    +
    + +
    + +

    Biometric Trade Routes

    + +

    + To help understand how PubFig has been used around the world by commercial, military, and academic organizations; existing publicly available research citing Public Figures Face Dataset was collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal research projects at that location. +

    + +
    + +
    +
    +
    + +
    +
      +
    • Academic
    • +
    • Commercial
    • +
    • Military / Government
    • +
    +
    Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    +
    + + +
    + +

    Dataset Citations

    +

    + The dataset citations used in the visualizations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Each citation was geocoded using names of institutions found in the PDF front matter, or as listed on other resources. These papers have been manually verified to show that researchers downloaded and used the dataset to train or test machine learning algorithms. +

    + +
    +
    + +
    + + + + + \ No newline at end of file diff --git a/site/public/datasets/uccs/index.html b/site/public/datasets/uccs/index.html index 593ac498..5bb120ba 100644 --- a/site/public/datasets/uccs/index.html +++ b/site/public/datasets/uccs/index.html @@ -4,7 +4,7 @@ MegaPixels - + @@ -26,7 +26,7 @@
    -
    Unconstrained College Students (UCCS) is a dataset of long-range surveillance photos of students taken without their knowledge
    The UCCS dataset includes 16,149 images and 1,732 identities of students at University of Colorado Colorado Springs campus and is used for face recognition and face detection +
    UnConstrained College Students is a dataset of long-range surveillance photos of students at University of Colorado in Colorado Springs
    The UnConstrained College Students dataset includes 16,149 images and 1,732 identities of subjects on University of Colorado Colorado Springs campus and is used for making face recognition and face detection algorithms

    Unconstrained College Students ...

    -

    (PAGE UNDER DEVELOPMENT)

    -

    Unconstrained College Students (UCCS) is a dataset of long-range surveillance photos captured at University of Colorado Colorado Springs. According to the authors of two papers associated with the dataset, subjects were "photographed using a long-range high-resolution surveillance camera without their knowledge" [^funding_sb]. The images were captured using a Canon 7D digital camera fitted with a Sigma 800mm telephoto lens pointed out the window of an office.

    -

    The UCCS dataset was funded by ODNI (Office of Director of National Intelligence), IARPA (Intelligence Advance Research Projects Activity), ONR MURI Office of Naval Research and The Department of Defense Multidisciplinary University Research Initiative, Army SBIR (Small Business Innovation Research), SOCOM SBIR (Special Operations Command and Small Business Innovation Research), and the National Science Foundation.

    -

    The images in UCCS include students walking between classes on campus over 19 days in 2012 - 2013. The dates include:

    +

    UnConstrained College Students

    +

    [ page under development ]

    +

    UnConstrained College Students (UCCS) is a dataset of long-range surveillance photos captured at University of Colorado Colorado Springs. According to the authors of two papers associated with the dataset, subjects were "photographed using a long-range high-resolution surveillance camera without their knowledge" 2. To create the dataset, the researchers used a Canon 7D digital camera fitted with a Sigma 800mm telephoto lens and photographed students 150–200m away through their office window. Photos were taken during the morning and afternoon while students were walking to and from classes. The primary uses of this dataset are to train, validate, and build recognition and face detection algorithms for realistic surveillance scenarios.

    +

    What makes the UCCS dataset unique is that it includes the highest resolution images of any publicly available face recognition dataset discovered so far (18MP), that it was captured on a campus without consent or awareness using a long-range telephoto lens, and that it was funded by United States defense and intelligence agencies.

    +

    Combined funding sources for the creation of the initial and final release of the dataset include ODNI (Office of Director of National Intelligence), IARPA (Intelligence Advance Research Projects Activity), ONR MURI (Office of Naval Research and The Department of Defense Multidisciplinary University Research Initiative), Army SBIR (Small Business Innovation Research), SOCOM SBIR (Special Operations Command and Small Business Innovation Research), and the National Science Foundation. 1 2

    +

    In 2017 the UCCS face dataset was used for a defense and intelligence agency funded face recognition challenge at the International Joint Biometrics Conference in Denver, CO. And in 2018 the dataset was used for the 2nd Unconstrained Face Detection and Open Set Recognition Challenge at the European Computer Vision Conference (ECCV) in Munich, Germany. Additional research projects that have used the UCCS dataset are included below in the list of verified citations.

    +
    +

    Who used UCCS?

    + +

    + This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. +

    + +
    + +
    + +
    +
    + +
    +
    +
    + +
    + +

    Biometric Trade Routes

    + +

    + To help understand how UCCS has been used around the world by commercial, military, and academic organizations; existing publicly available research citing UnConstrained College Students Dataset was collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal research projects at that location. +

    + +
    + +
    +
    +
    + +
    +
      +
    • Academic
    • +
    • Commercial
    • +
    • Military / Government
    • +
    +
    Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    +
    + + +
    + +

    Dataset Citations

    +

    + The dataset citations used in the visualizations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Each citation was geocoded using names of institutions found in the PDF front matter, or as listed on other resources. These papers have been manually verified to show that researchers downloaded and used the dataset to train or test machine learning algorithms. +

    + +
    +
    + +
    +
    +
    +
    + +

    Supplementary Information

    + +

    Dates and Times

    +

    The images in UCCS were taken on 18 non-consecutive days during 2012–2013. Analysis of the EXIF data embedded in original images reveal that most of the images were taken on Tuesdays, and the most frequent capture time throughout the week was 12:30PM.

    +
     UCCS photos captured per weekday © megapixels.cc
    UCCS photos captured per weekday © megapixels.cc
     UCCS photos captured per 10-minute intervals per weekday © megapixels.cc
    UCCS photos captured per 10-minute intervals per weekday © megapixels.cc

    UCCS photos taken in 2012

    - - - - - - - - - + - - - - - - + + - - - - - - + + - - - - - - + + - - - - - - + + - - - - - - + + - - - - - - + + - - - - - - + + - - - - - - + + - - - - - - + + - - - - - - + + - - - - - - + + - - - - - - - + +
    YearMonthDay DateTime Range Photos
    2012Februay---23-Feb 23, 2012 132
    2012March---6--March 6, 2012288
    2012March---8--March 8, 2012506
    2012March---13--March 13, 2012160
    2012Februay---23-132March 20, 20121,840
    2012March---6--March 22, 2012445
    2012March---8--April 3, 20121,639
    2012March---13--April 12, 201214
    2012Februay---23-132April 17, 201219
    2012March---6--April 24, 201263
    2012March---8--April 25, 201211
    2012March---13--April 26, 201220
    2012Februay---23-132
    +

    UCCS photos taken in 2013

    + + + + + + - - - - - - + + - - - - - - + + - - - - - - + + - - - - - - + + - - - - - - + + - - - - - - + +
    DatePhotos
    2012March---6--Jan 28, 20131,056
    2012March---8--Jan 29, 20131,561
    2012March---13--Feb 13, 2013739
    2012Februay---23-132Feb 19, 2013723
    2012March---6--Feb 20, 2013965
    2012March---8--Feb 26, 2013736
    -

    2012-03-20 -2012-03-22 -2012-04-03 -2012-04-12 -2012-04-17 -2012-04-24 -2012-04-25 -2012-04-26 -2013-01-28 -2013-01-29 -2013-02-13 -2013-02-19 -2013-02-20 -2013-02-26

    -
     The pixel-average of all Uconstrained College Students images is shown with all 51,838 face annotations. (c) Adam Harvey
    The pixel-average of all Uconstrained College Students images is shown with all 51,838 face annotations. (c) Adam Harvey
    - -

    Biometric Trade Routes

    - -

    - To help understand how UCCS has been used around the world for commercial, military and academic research; publicly available research citing UnConstrained College Students Dataset is collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal reserach projects at that location. -

    - -
    - -
    -
    - -
    - -
    -
      -
    • Academic
    • -
    • Commercial
    • -
    • Military / Government
    • -
    • Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    • -
    -
    - -
    -

    Who used UCCS?

    - -

    - This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries. -

    - -
    - -
    - -
    -
    -
    -
    - -

    Dataset Citations

    -

    - The dataset citations used in the visualizations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Each citation was geocoded using names of institutions found in the PDF front matter, or as listed on other resources. These papers have been manually verified to show that researchers downloaded and used the dataset to train or test machine learning algorithms. -

    - -
    -
    - -
    -
    -
    -
    - -

    Supplementary Information

    - -

    The original Sapkota and Boult dataset, from which UCCS is derived, received funding from1:

    +

    Location

    +

    The location of the camera and subjects can confirmed using the Bellingcat method. The visual clues that lead to location of the camera and subjects include the unique pattern of the sidewalk that is only used on the UCCS Pedestrian Spine near the West Lawn, the two UCCS sign poles with matching graphics still visible in Google Street View, the no parking sign and directionality of its arrow, the back of street sign next to it, the slight bend in the sidewalk, the presence of cars passing in the background of the image, and the far wall of the parking garage all match images in the dataset. The original papers also provides another clue: a picture of the camera inside the office that was used to create the dataset. The window view in this image provides another match for the brick pattern on the north facade of the Kraember Family Library and the green metal fence along the sidewalk. View the location on Google Maps

    +
     Location on campus where students were unknowingly photographed with a telephoto lens to be used for defense and intelligence agency funded research on face recognition. Image: Google Maps
    Location on campus where students were unknowingly photographed with a telephoto lens to be used for defense and intelligence agency funded research on face recognition. Image: Google Maps
     3D view showing the angle of view of the surveillance camera used for UCCS dataset. Image: Google Maps
    3D view showing the angle of view of the surveillance camera used for UCCS dataset. Image: Google Maps

    Funding

    +

    The UnConstrained College Students dataset is associated with two main research papers: "Large Scale Unconstrained Open Set Face Database" and "Unconstrained Face Detection and Open-Set Face Recognition Challenge". Collectively, these papers and the creation of the dataset have received funding from the following organizations:

    • ONR (Office of Naval Research) MURI (The Department of Defense Multidisciplinary University Research Initiative) grant N00014-08-1-0638
    • Army SBIR (Small Business Innovation Research) grant W15P7T-12-C-A210
    • SOCOM (Special Operations Command) SBIR (Small Business Innovation Research) grant H92222-07-P-0020
    • -
    -

    The more recent UCCS version of the dataset received funding from 2:

    -
    • National Science Foundation Grant IIS-1320956
    • ODNI (Office of Director of National Intelligence)
    • IARPA (Intelligence Advance Research Projects Activity) R&D contract 2014-14071600012
    -

    TODO

    +

    Opting Out

    +

    If you attended University of Colorado Colorado Springs and were captured by the long range surveillance camera used to create this dataset, there is unfortunately currently no way to be removed. The authors do not provide any options for students to opt-out nor were students informed they would be used for training face recognition. According to the authors, the lack of any consent or knowledge of participation is what provides part of the value of Unconstrained College Students Dataset.

    +

    Ethics

    +

    Please direct any questions about the ethics of the dataset to the University of Colorado Colorado Springs Ethics and Compliance Office

    +

    Technical Details

    +

    For further technical information about the dataset, visit the UCCS dataset project page.

    +

    Under Development

      -
    • add tabulator module for dates
    • -
    • parse dates into CSV using Python
    • -
    • get google image showing line of sight?
    • -
    • fix up quote/citations
    • +
    • adding more verified locations to map and charts
    • +
    • add EXIF file to CDN
    -

    footnotes

    -
    -
    -
    1. Sapkota, Archana and Boult, Terrance. "Large Scale Unconstrained Open Set Face Database." 2013.

    2. -
    3. Günther, M. et. al. "Unconstrained Face Detection and Open-Set Face Recognition Challenge," 2018. Arxiv 1708.02337v3.

    4. -
    -
    -
    +
    • a

      Sapkota, Archana and Boult, Terrance. "Large Scale Unconstrained Open Set Face Database." 2013.

      +
    • ab

      Günther, M. et. al. "Unconstrained Face Detection and Open-Set Face Recognition Challenge," 2018. Arxiv 1708.02337v3.

      +
    MegaPixels ©2017-19 Adam R. Harvey /  diff --git a/site/public/datasets/vgg_face2/index.html b/site/public/datasets/vgg_face2/index.html index d5c1d98c..321fb203 100644 --- a/site/public/datasets/vgg_face2/index.html +++ b/site/public/datasets/vgg_face2/index.html @@ -17,7 +17,7 @@
    MegaPixels
    - +
    Brainwash Dataset

    VIPeR Dataset

    +

    [ page under development ]

    VIPeR (Viewpoint Invariant Pedestrian Recognition) is a dataset of pedestrian images captured at University of California Santa Cruz in 2007. Accoriding to the reserachers 2 "cameras were placed in different locations in an academic setting and subjects were notified of the presence of cameras, but were not coached or instructed in any way."

    VIPeR is amongst the most widely used publicly available person re-identification datasets. In 2017 the VIPeR dataset was combined into a larger person re-identification created by the Chinese University of Hong Kong called PETA (PEdesTrian Attribute).

    @@ -62,30 +62,24 @@
    -
    +
    + +
    -
    +
    + +

    Biometric Trade Routes

    - +

    - To help understand how VIPeR has been used around the world for commercial, military and academic research; publicly available research citing Viewpoint Invariant Pedestrian Recognition is collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal reserach projects at that location. + To help understand how VIPeR has been used around the world by commercial, military, and academic organizations; existing publicly available research citing Viewpoint Invariant Pedestrian Recognition was collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal research projects at that location.

    -
    @@ -93,25 +87,12 @@
  • Academic
  • Commercial
  • Military / Government
  • -
  • Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
  • +
    Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.
    -
    -
    -
    -
    -
    - -

    Supplementary Information

    - -
    +

    Dataset Citations

    @@ -125,11 +106,10 @@