summaryrefslogtreecommitdiff
path: root/site/content/pages/datasets
diff options
context:
space:
mode:
Diffstat (limited to 'site/content/pages/datasets')
-rw-r--r--site/content/pages/datasets/brainwash/assets/00818000_640x480.jpgbin33112 -> 0 bytes
-rw-r--r--site/content/pages/datasets/brainwash/assets/background_540.jpgbin83594 -> 0 bytes
-rwxr-xr-xsite/content/pages/datasets/brainwash/assets/background_600.jpgbin86425 -> 0 bytes
-rwxr-xr-xsite/content/pages/datasets/brainwash/assets/brainwash_mean_overlay.jpgbin0 -> 150399 bytes
-rwxr-xr-xsite/content/pages/datasets/brainwash/assets/brainwash_mean_overlay_wm.jpgbin0 -> 151713 bytes
-rw-r--r--site/content/pages/datasets/brainwash/index.md44
-rwxr-xr-xsite/content/pages/datasets/duke_mtmc/assets/duke_mtmc_cam5_average_comp.jpgbin0 -> 195172 bytes
-rw-r--r--site/content/pages/datasets/duke_mtmc/index.md28
-rw-r--r--site/content/pages/datasets/index.md2
-rw-r--r--site/content/pages/datasets/msceleb/assets/background.jpgbin0 -> 422970 bytes
-rw-r--r--site/content/pages/datasets/msceleb/assets/index.jpgbin0 -> 39839 bytes
-rw-r--r--site/content/pages/datasets/msceleb/index.md56
-rw-r--r--site/content/pages/datasets/uccs/assets/uccs_bboxes_clr_fill.jpgbin146050 -> 0 bytes
-rw-r--r--site/content/pages/datasets/uccs/assets/uccs_bboxes_grayscale.jpgbin299802 -> 0 bytes
-rw-r--r--site/content/pages/datasets/uccs/assets/uccs_mean_bboxes_comp.jpgbin0 -> 253215 bytes
-rw-r--r--site/content/pages/datasets/uccs/index.md78
16 files changed, 161 insertions, 47 deletions
diff --git a/site/content/pages/datasets/brainwash/assets/00818000_640x480.jpg b/site/content/pages/datasets/brainwash/assets/00818000_640x480.jpg
deleted file mode 100644
index 30c0fcb1..00000000
--- a/site/content/pages/datasets/brainwash/assets/00818000_640x480.jpg
+++ /dev/null
Binary files differ
diff --git a/site/content/pages/datasets/brainwash/assets/background_540.jpg b/site/content/pages/datasets/brainwash/assets/background_540.jpg
deleted file mode 100644
index 5c8c0ad4..00000000
--- a/site/content/pages/datasets/brainwash/assets/background_540.jpg
+++ /dev/null
Binary files differ
diff --git a/site/content/pages/datasets/brainwash/assets/background_600.jpg b/site/content/pages/datasets/brainwash/assets/background_600.jpg
deleted file mode 100755
index 8f2de697..00000000
--- a/site/content/pages/datasets/brainwash/assets/background_600.jpg
+++ /dev/null
Binary files differ
diff --git a/site/content/pages/datasets/brainwash/assets/brainwash_mean_overlay.jpg b/site/content/pages/datasets/brainwash/assets/brainwash_mean_overlay.jpg
new file mode 100755
index 00000000..2f5917e3
--- /dev/null
+++ b/site/content/pages/datasets/brainwash/assets/brainwash_mean_overlay.jpg
Binary files differ
diff --git a/site/content/pages/datasets/brainwash/assets/brainwash_mean_overlay_wm.jpg b/site/content/pages/datasets/brainwash/assets/brainwash_mean_overlay_wm.jpg
new file mode 100755
index 00000000..790dbb79
--- /dev/null
+++ b/site/content/pages/datasets/brainwash/assets/brainwash_mean_overlay_wm.jpg
Binary files differ
diff --git a/site/content/pages/datasets/brainwash/index.md b/site/content/pages/datasets/brainwash/index.md
index 0bf67455..db88d949 100644
--- a/site/content/pages/datasets/brainwash/index.md
+++ b/site/content/pages/datasets/brainwash/index.md
@@ -2,8 +2,8 @@
status: published
title: Brainwash
-desc: Brainwash is a dataset of webcam images taken from the Brainwash Cafe in San Francisco
-subdesc: The Brainwash dataset includes 11,918 images of "everyday life of a busy downtown cafe" and is used for training head detection algorithms
+desc: Brainwash is a dataset of webcam images taken from the Brainwash Cafe in San Francisco in 2014
+subdesc: The Brainwash dataset includes 11,918 images of "everyday life of a busy downtown cafe" and is used for training head detection surveillance algorithms
slug: brainwash
cssclass: dataset
image: assets/background.jpg
@@ -15,32 +15,18 @@ authors: Adam Harvey
------------
### sidebar
-
-+ Published: 2015
-+ Images: 11,918
-+ Faces: 91,146
-+ Created by: Stanford Department of Computer Science
-+ Funded by: Max Planck Center for Visual Computing and Communication
-+ Location: Brainwash Cafe, San Franscisco
-+ Purpose: Training face detection
-+ Website: <a href="https://exhibits.stanford.edu/data/catalog/sx925dc9385">stanford.edu</a>
-+ Paper: <a href="http://arxiv.org/abs/1506.04878">End-to-End People Detection in Crowded Scenes</a>
-+ Explicit Consent: No
-
+### end sidebar
## Brainwash Dataset
-(PAGE UNDER DEVELOPMENT)
+*Brainwash* is a head detection dataset created from San Francisco's Brainwash Cafe livecam footage. It includes 11,918 images of "everyday life of a busy downtown cafe"[^readme] captured at 100 second intervals throught the entire day. Brainwash dataset was captured during 3 days in 2014: October 27, November 13, and November 24. According the author's reserach paper introducing the dataset, the images were acquired with the help of Angelcam.com [cite orig paper].
-*Brainwash* is a face detection dataset created from the Brainwash Cafe's livecam footage including 11,918 images of "everyday life of a busy downtown cafe[^readme]". The images are used to develop face detection algorithms for the "challenging task of detecting people in crowded scenes" and tracking them.
+Brainwash is not a widely used dataset but since its publication by Stanford University in 2015, it has notably appeared in several research papers from the National University of Defense Technology in Changsha, China. In 2016 and in 2017 researchers there conducted studies on detecting people's heads in crowded scenes for the purpose of surveillance [^localized_region_context] [^replacement_algorithm].
-Before closing in 2017, Brainwash Cafe was a "cafe and laundromat" located in San Francisco's SoMA district. The cafe published a publicy available livestream from the cafe with a view of the cash register, performance stage, and seating area.
+If you happen to have been at Brainwash cafe in San Franscisco at any time on October 26, November 13, or November 24 in 2014 you are most likely included in the Brainwash dataset.
-Since it's publication by Stanford in 2015, the Brainwash dataset has appeared in several notable research papers. In September 2016 four researchers from the National University of Defense Technology in Changsha, China used the Brainwash dataset for a research study on "people head detection in crowded scenes", concluding that their algorithm "achieves superior head detection performance on the crowded scenes dataset[^localized_region_context]". And again in 2017 three researchers at the National University of Defense Technology used Brainwash for a study on object detection noting "the data set used in our experiment is shown in Table 1, which includes one scene of the brainwash dataset[^replacement_algorithm]".
+![caption: The pixel-averaged image of all Brainwash dataset images is shown with 81,973 head annotations drawn from the Brainwash training partition. (c) Adam Harvey](assets/brainwash_mean_overlay.jpg)
-![caption: An sample image from the Brainwash dataset used for training face and head detection algorithms for surveillance. The datset contains about 12,000 images. License: Open Data Commons Public Domain Dedication (PDDL)](assets/00425000_960.jpg)
-
-![caption: 49 of the 11,918 images included in the Brainwash dataset. License: Open Data Commons Public Domain Dedication (PDDL)](assets/brainwash_montage.jpg)
{% include 'chart.html' %}
@@ -48,19 +34,27 @@ Since it's publication by Stanford in 2015, the Brainwash dataset has appeared i
{% include 'map.html' %}
-Add more analysis here
-
+{% include 'citations.html' %}
{% include 'supplementary_header.html' %}
-{% include 'citations.html' %}
+![caption: An sample image from the Brainwash dataset used for training face and head detection algorithms for surveillance. The datset contains about 12,000 images. License: Open Data Commons Public Domain Dedication (PDDL)](assets/00425000_960.jpg)
+![caption: 49 of the 11,918 images included in the Brainwash dataset. License: Open Data Commons Public Domain Dedication (PDDL)](assets/brainwash_montage.jpg)
-### Additional Information
+#### Additional Resources
- The dataset author spoke about his research at the CVPR conference in 2016 <https://www.youtube.com/watch?v=Nl2fBKxwusQ>
+TODO
+
+- add bounding boxes to the header image
+- remake montage with randomized images, with bboxes
+- clean up intro text
+- verify quote citations
+
+
### Footnotes
[^readme]: "readme.txt" https://exhibits.stanford.edu/data/catalog/sx925dc9385.
diff --git a/site/content/pages/datasets/duke_mtmc/assets/duke_mtmc_cam5_average_comp.jpg b/site/content/pages/datasets/duke_mtmc/assets/duke_mtmc_cam5_average_comp.jpg
new file mode 100755
index 00000000..3cd64df1
--- /dev/null
+++ b/site/content/pages/datasets/duke_mtmc/assets/duke_mtmc_cam5_average_comp.jpg
Binary files differ
diff --git a/site/content/pages/datasets/duke_mtmc/index.md b/site/content/pages/datasets/duke_mtmc/index.md
index de1fa14c..c626ef4e 100644
--- a/site/content/pages/datasets/duke_mtmc/index.md
+++ b/site/content/pages/datasets/duke_mtmc/index.md
@@ -2,8 +2,8 @@
status: published
title: Duke Multi-Target, Multi-Camera Tracking
-desc: <span class="dataset-name">Duke MTMC</span> is a dataset of CCTV footage of students at Duke University
-subdesc: Duke MTMC contains over 2 million video frames and 2,000 unique identities collected from 8 cameras at Duke University campus in March 2014
+desc: <span class="dataset-name">Duke MTMC</span> is a dataset of surveillance camera footage of students on Duke University campus
+subdesc: Duke MTMC contains over 2 million video frames and 2,000 unique identities collected from 8 HD cameras at Duke University campus in March 2014
slug: duke_mtmc
cssclass: dataset
image: assets/background.jpg
@@ -15,17 +15,27 @@ authors: Adam Harvey
### sidebar
-+ Collected: March 19, 2014
-+ Cameras: 8
-+ Video Frames: 2,000,000
-+ Identities: Over 2,000
-+ Used for: Person re-identification, <br>face recognition
-+ Sector: Academic
++ Created: 2014
++ Identities: Over 2,700
++ Used for: Face recognition, person re-identification
++ Created by: Computer Science Department, Duke University, Durham, US
+ Website: <a href="http://vision.cs.duke.edu/DukeMTMC/">duke.edu</a>
## Duke Multi-Target, Multi-Camera Tracking Dataset (Duke MTMC)
-(PAGE UNDER DEVELOPMENT)
+[ PAGE UNDER DEVELOPMENT ]
+
+Duke MTMC is a dataset of video recorded on Duke University campus during for the purpose of training, evaluating, and improving *multi-target multi-camera tracking*. The videos were recorded during February and March 2014 and cinclude
+
+Includes a total of 888.8 minutes of video (ind. verified)
+
+"We make available a new data set that has more than 2 million frames and more than 2,700 identities. It consists of 8×85 minutes of 1080p video recorded at 60 frames per second from 8 static cameras deployed on the Duke University campus during periods between lectures, when pedestrian traffic is heavy."
+
+The dataset includes approximately 2,000 annotated identities appearing in 85 hours of video from 8 cameras located throughout Duke University's campus.
+
+![caption: Duke MTMC pixel-averaged image of camera #5 is shown with the bounding boxes for each student drawn in white. (c) Adam Harvey](assets/duke_mtmc_cam5_average_comp.jpg)
+
+According to the dataset authors,
{% include 'map.html' %}
diff --git a/site/content/pages/datasets/index.md b/site/content/pages/datasets/index.md
index 2e943fbe..c0373d60 100644
--- a/site/content/pages/datasets/index.md
+++ b/site/content/pages/datasets/index.md
@@ -13,4 +13,4 @@ sync: false
# Facial Recognition Datasets
-### Survey
+Explore publicly available facial recognition datasets. More datasets will be added throughout 2019.
diff --git a/site/content/pages/datasets/msceleb/assets/background.jpg b/site/content/pages/datasets/msceleb/assets/background.jpg
new file mode 100644
index 00000000..c1cd486e
--- /dev/null
+++ b/site/content/pages/datasets/msceleb/assets/background.jpg
Binary files differ
diff --git a/site/content/pages/datasets/msceleb/assets/index.jpg b/site/content/pages/datasets/msceleb/assets/index.jpg
new file mode 100644
index 00000000..fb3a934a
--- /dev/null
+++ b/site/content/pages/datasets/msceleb/assets/index.jpg
Binary files differ
diff --git a/site/content/pages/datasets/msceleb/index.md b/site/content/pages/datasets/msceleb/index.md
new file mode 100644
index 00000000..eb084eaa
--- /dev/null
+++ b/site/content/pages/datasets/msceleb/index.md
@@ -0,0 +1,56 @@
+------------
+
+status: published
+title: MS Celeb
+desc: MS Celeb is a dataset of web images used for training and evaluating face recognition algorithms
+subdesc: The MS Celeb dataset includes over 10,000,000 images and 93,000 identities of semi-public figures collected using the Bing search engine
+slug: msceleb
+cssclass: dataset
+image: assets/background.jpg
+year: 2015
+published: 2019-2-23
+updated: 2019-2-23
+authors: Adam Harvey
+
+------------
+
+### sidebar
+
++ Published: TBD
++ Images: TBD
++ Faces: TBD
++ Created by: TBD
+
+
+## Microsoft Celeb Dataset (MS Celeb)
+
+(PAGE UNDER DEVELOPMENT)
+
+At vero eos et accusamus et iusto odio dignissimos ducimus, qui blanditiis praesentium voluptatum deleniti atque corrupti, quos dolores et quas molestias excepturi sint, obcaecati cupiditate non-provident, similique sunt in culpa, qui officia deserunt mollitia animi, id est laborum et dolorum fuga. Et harum quidem rerum facilis est et expedita distinctio.
+
+Nam libero tempore, cum soluta nobis est eligendi optio, cumque nihil impedit, quo minus id, quod maxime placeat, facere possimus, omnis voluptas assumenda est, omnis dolor repellendus. Temporibus autem quibusdam et aut officiis debitis aut rerum necessitatibus saepe eveniet, ut et voluptates repudiandae sint et molestiae non-recusandae. Itaque earum rerum hic tenetur a sapiente delectus, ut aut reiciendis voluptatibus maiores alias consequatur aut perferendis doloribus asperiores repellat
+
+{% include 'chart.html' %}
+
+{% include 'piechart.html' %}
+
+{% include 'map.html' %}
+
+Add more analysis here
+
+
+{% include 'supplementary_header.html' %}
+
+{% include 'citations.html' %}
+
+
+### Additional Information
+
+- The dataset author spoke about his research at the CVPR conference in 2016 <https://www.youtube.com/watch?v=Nl2fBKxwusQ>
+
+
+### Footnotes
+
+[^readme]: "readme.txt" https://exhibits.stanford.edu/data/catalog/sx925dc9385.
+[^localized_region_context]: Li, Y. and Dou, Y. and Liu, X. and Li, T. Localized Region Context and Object Feature Fusion for People Head Detection. ICIP16 Proceedings. 2016. Pages 594-598.
+[^replacement_algorithm]: Zhao. X, Wang Y, Dou, Y. A Replacement Algorithm of Non-Maximum Suppression Base on Graph Clustering. \ No newline at end of file
diff --git a/site/content/pages/datasets/uccs/assets/uccs_bboxes_clr_fill.jpg b/site/content/pages/datasets/uccs/assets/uccs_bboxes_clr_fill.jpg
deleted file mode 100644
index c8002bb9..00000000
--- a/site/content/pages/datasets/uccs/assets/uccs_bboxes_clr_fill.jpg
+++ /dev/null
Binary files differ
diff --git a/site/content/pages/datasets/uccs/assets/uccs_bboxes_grayscale.jpg b/site/content/pages/datasets/uccs/assets/uccs_bboxes_grayscale.jpg
deleted file mode 100644
index 6e2833dd..00000000
--- a/site/content/pages/datasets/uccs/assets/uccs_bboxes_grayscale.jpg
+++ /dev/null
Binary files differ
diff --git a/site/content/pages/datasets/uccs/assets/uccs_mean_bboxes_comp.jpg b/site/content/pages/datasets/uccs/assets/uccs_mean_bboxes_comp.jpg
new file mode 100644
index 00000000..18f4c5ec
--- /dev/null
+++ b/site/content/pages/datasets/uccs/assets/uccs_mean_bboxes_comp.jpg
Binary files differ
diff --git a/site/content/pages/datasets/uccs/index.md b/site/content/pages/datasets/uccs/index.md
index 092638c0..8ae1f324 100644
--- a/site/content/pages/datasets/uccs/index.md
+++ b/site/content/pages/datasets/uccs/index.md
@@ -2,11 +2,12 @@
status: published
title: Unconstrained College Students
-desc: <span class="dataset-name">Unconstrained College Students (UCCS)</span> is a dataset of images ...
-subdesc: The UCCS dataset includes ...
slug: uccs
+desc: <span class="dataset-name">Unconstrained College Students (UCCS)</span> is a dataset of long-range surveillance photos of students taken without their knowledge
+subdesc: The UCCS dataset includes 16,149 images and 1,732 identities of students at University of Colorado Colorado Springs campus and is used for face recognition and face detection
cssclass: dataset
image: assets/background.jpg
+slug: uccs
published: 2019-2-23
updated: 2019-2-23
authors: Adam Harvey
@@ -15,30 +16,75 @@ authors: Adam Harvey
### sidebar
-+ Collected: TBD
-+ Published: TBD
-+ Images: TBD
-+ Faces: TBD
++ Published: 2018
++ Images: 16,149
++ Identities: 1,732
++ Used for: Face recognition, face detection
++ Created by: Unviversity of Colorado Colorado Springs (US)
++ Funded by: ODNI, IARPA, ONR MURI, Amry SBIR, SOCOM SBIR
++ Website: <a href="https://vast.uccs.edu/Opensetface/">vast.uccs.edu</a>
## Unconstrained College Students ...
(PAGE UNDER DEVELOPMENT)
+Unconstrained College Students (UCCS) is a dataset of long-range surveillance photos captured at University of Colorado Colorado Springs. According to the authors of two papers associated with the dataset, subjects were "photographed using a long-range high-resolution surveillance camera without their knowledge" [^funding_sb]. The images were captured using a Canon 7D digital camera fitted with a Sigma 800mm telephoto lens pointed out the window of an office.
+
+The UCCS dataset was funded by ODNI (Office of Director of National Intelligence), IARPA (Intelligence Advance Research Projects Activity), ONR MURI Office of Naval Research and The Department of Defense Multidisciplinary University Research Initiative, Army SBIR (Small Business Innovation Research), SOCOM SBIR (Special Operations Command and Small Business Innovation Research), and the National Science Foundation.
+
+The images in UCCS include students walking between classes on campus over 19 days in 2012 - 2013. The dates include:
+
+| Year | Month | Day | Date | Time Range | Photos |
+| --- | --- | --- | --- | --- | --- |
+| 2012 | Februay | --- | 23 | - | 132 |
+| 2012 | March | --- | 6 | - | - |
+| 2012 | March | --- | 8 | - | - |
+| 2012 | March | --- | 13 | - | - |
+| 2012 | Februay | --- | 23 | - | 132 |
+| 2012 | March | --- | 6 | - | - |
+| 2012 | March | --- | 8 | - | - |
+| 2012 | March | --- | 13 | - | - |
+| 2012 | Februay | --- | 23 | - | 132 |
+| 2012 | March | --- | 6 | - | - |
+| 2012 | March | --- | 8 | - | - |
+| 2012 | March | --- | 13 | - | - |
+| 2012 | Februay | --- | 23 | - | 132 |
+| 2012 | March | --- | 6 | - | - |
+| 2012 | March | --- | 8 | - | - |
+| 2012 | March | --- | 13 | - | - |
+| 2012 | Februay | --- | 23 | - | 132 |
+| 2012 | March | --- | 6 | - | - |
+| 2012 | March | --- | 8 | - | - |
+
+
+2012-03-20
+2012-03-22
+2012-04-03
+2012-04-12
+2012-04-17
+2012-04-24
+2012-04-25
+2012-04-26
+2013-01-28
+2013-01-29
+2013-02-13
+2013-02-19
+2013-02-20
+2013-02-26
+
+![caption: The pixel-average of all Uconstrained College Students images is shown with all 51,838 face annotations. (c) Adam Harvey](assets/uccs_mean_bboxes_comp.jpg)
+
+
{% include 'map.html' %}
{% include 'chart.html' %}
{% include 'piechart.html' %}
-{% include 'supplementary_header.html' %}
-
{% include 'citations.html' %}
-
-![Bounding box visualization](assets/uccs_bboxes_grayscale.jpg)
-
-### Research Notes
+{% include 'supplementary_header.html' %}
The original Sapkota and Boult dataset, from which UCCS is derived, received funding from[^funding_sb]:
@@ -53,6 +99,14 @@ The more recent UCCS version of the dataset received funding from [^funding_uccs
- IARPA (Intelligence Advance Research Projects Activity) R&D contract 2014-14071600012
+### TODO
+
+- add tabulator module for dates
+- parse dates into CSV using Python
+- get google image showing line of sight?
+- fix up quote/citations
+
+### footnotes
[^funding_sb]: Sapkota, Archana and Boult, Terrance. "Large Scale Unconstrained Open Set Face Database." 2013.
[^funding_uccs]: Günther, M. et. al. "Unconstrained Face Detection and Open-Set Face Recognition Challenge," 2018. Arxiv 1708.02337v3. \ No newline at end of file