summaryrefslogtreecommitdiff
path: root/site/content/pages/datasets/uccs/index.md
blob: 767f822057ea042cf3427f93f3e08bee33949502 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
------------

status: published
title: UnConstrained College Students
slug: uccs
desc: <span class="dataset-name">UnConstrained College Students</span> is a dataset of long-range surveillance photos of students on University of Colorado in Colorado Springs campus
subdesc: The UnConstrained College Students dataset includes 16,149 images of 1,732 students, faculty, and pedestrians and is used for developing face recognition and face detection algorithms
image: assets/background.jpg
cssclass: dataset
image: assets/background.jpg
slug: uccs
published: 2019-2-23
updated: 2019-4-15
authors: Adam Harvey

------------

### sidebar
### end sidebar

## UnConstrained College Students

UnConstrained College Students (UCCS) is a dataset of long-range surveillance photos captured at University of Colorado Colorado Springs. According to the authors of two papers associated with the dataset, over 1,700 students and pedestrians were "photographed using a long-range high-resolution surveillance camera without their knowledge" [^funding_uccs]. In this investigation, we examine the funding sources, contents of the dataset, photo EXIF data, and publicy available research project citations.

According to the author's of the the UnConstrained College Students dataset it is primarliy used for research and development of "face detection and recognition research towards surveillance applications that are becoming more popular and more required nowadays, and where no automatic recognition algorithm has proven to be useful yet." Applications of this technology include usage by defense and intelligence agencies, who were also the primary funding sources of the UCCS dataset.



In the two papers associated with the release of the UCCS datasaet ([Unconstrained Face Detection and Open-Set Face Recognition Challenge](https://www.semanticscholar.org/paper/Unconstrained-Face-Detection-and-Open-Set-Face-G%C3%BCnther-Hu/d4f1eb008eb80595bcfdac368e23ae9754e1e745) and [Large Scale Unconstrained Open Set Face Database](https://www.semanticscholar.org/paper/Large-scale-unconstrained-open-set-face-database-Sapkota-Boult/07fcbae86f7a3ad3ea1cf95178459ee9eaf77cb1)), the researchers disclosed their funding sources as ODNI (United States Office of Director of National Intelligence), IARPA (Intelligence Advance Research Projects Activity), ONR MURI (Office of Naval Research and The Department of Defense Multidisciplinary University Research Initiative), Army SBIR (Small Business Innovation Research), SOCOM SBIR (Special Operations Command and Small Business Innovation Research), and the National Science Foundation. Further, UCCS's VAST site explicity [states](https://vast.uccs.edu/project/iarpa-janus/) they are part of the [IARPA Janus](https://www.iarpa.gov/index.php/research-programs/janus), a face recognition project developed to serve the needs of national intelligence interests.

![caption: Location on campus where students were unknowingly photographed with a telephoto lens to be used for defense and intelligence agency funded research on face recognition. Image: Google Maps](assets/uccs_map_aerial.jpg)

The UCCS dataset includes the highest resolution images of any publicly available face recognition dataset discovered so far (18MP) and was, as of 2018, the "largest surveillance FR benchmark in the public domain."[^surv_face_qmul] To create the dataset, the researchers used a Canon 7D digital camera fitted with a Sigma 800mm telephoto lens and photographed students from a distance of 150&ndash;200m through their office window. Photos were taken during the morning and afternoon while students were walking to and from classes. According to an analysis of the EXIF data embedded in the photos, nearly half of the 16,149 photos were taken on Tuesdays. The most popular time was during lunch break. All of the photos were taken during the spring semester in 2012 and 2013 but the dataset was not publicy released until 2016.

In 2017 the UCCS face dataset was used for a defense and intelligence agency funded [face recognition challenge](http://www.face-recognition-challenge.com/) at the International Joint Biometrics Conference in Denver, CO. And in 2018 the dataset was again used for the [2nd Unconstrained Face Detection and Open Set Recognition Challenge](https://erodner.github.io/ial2018eccv/) at the European Computer Vision Conference (ECCV) in Munich, Germany. Additional research projects that have used the UCCS dataset are included below in the list of verified citations.

As of April 15, 2019, the UCCS dataset is no longer available for public download. During the three years it was publicly available (2016-2019) the UCCS dataset apepared in at least 5 publicly available research papers including verified usage from University of Notre Dame (US), Beihang University (China), Beckman Institute (US), Queen Mary University of London (UK), Carnegie Mellon University (US),Karlsruhe Institute of Technology (DE), and Vision Semantics Ltd (UK) who [lists](http://visionsemantics.com/partners.html) the UK Ministry of Defence and Metropolitan Police as partners.

To show the types of face images used in the UCCS student dataset while protecting their individual privacy, a generative adversarial network was used to interpolate between identities in the dataset. The image below shows a generative adversarial network trained on the UCCS face bounding box areas from 16,000 images and over 90,000 face regions.

![caption: GAN generated approximations of students in the UCCS dataset. &copy; megapixels.cc 2018](assets/uccs_pgan_01.jpg)


{% include 'dashboard.html' %}

{% include 'supplementary_header.html' %}

### Dates and Times

The images in UCCS were taken on 18 non-consecutive days during 2012&ndash;2013. Analysis of the [EXIF data](assets/uccs_camera_exif.csv) embedded in original images reveal that most of the images were taken on Tuesdays, and the most frequent capture time throughout the week was 12:30PM.

![caption: UCCS photos captured per weekday &copy; megapixels.cc](assets/uccs_exif_plot_days.png)

![caption: UCCS photos captured per 10-minute intervals per weekday &copy; megapixels.cc](assets/uccs_exif_plot.png)


=== columns 2

#### UCCS photos taken in 2012

| Date  | Photos |
| --- | --- |
| Feb 23, 2012 | 132 |
| March 6, 2012 | 288 |
| March 8, 2012 | 506 |
| March 13, 2012 | 160 |
| March 20, 2012 | 1,840 |
| March 22, 2012 | 445 |
| April 3, 2012 | 1,639 |
| April 12, 2012 | 14 |
| April 17, 2012 | 19 |
| April 24, 2012 | 63 |
| April 25, 2012 | 11 |
| April 26, 2012 | 20 |

===========

#### UCCS photos taken in 2013

| Date  | Photos |
| --- | --- |
| Jan 28, 2013 | 1,056 |
| Jan 29, 2013 | 1,561 |
| Feb 13, 2013 | 739 |
| Feb 19, 2013 | 723 |
| Feb 20, 2013 | 965 |
| Feb 26, 2013 | 736 |

=== end columns


### Location

The location of the camera and subjects can confirmed using several visual cues in the dataset images: the unique pattern of the sidewalk that is only used on the UCCS Pedestrian Spine near the West Lawn, the two UCCS sign poles with matching graphics still visible in Google Street View, the no parking sign and directionality of its arrow, the back of street sign next to it, the slight bend in the sidewalk, the presence of cars passing in the background of the image, and the far wall of the parking garage all match images in the dataset. The [original papers](https://www.semanticscholar.org/paper/Large-scale-unconstrained-open-set-face-database-Sapkota-Boult/07fcbae86f7a3ad3ea1cf95178459ee9eaf77cb1) also provides another clue: a [picture of the camera](https://www.semanticscholar.org/paper/Large-scale-unconstrained-open-set-face-database-Sapkota-Boult/07fcbae86f7a3ad3ea1cf95178459ee9eaf77cb1/figure/1) inside the office that was used to create the dataset. The window view in this image provides another match for the brick pattern on the north facade of the Kraember Family Library and the green metal fence along the sidewalk. View the [location on Google Maps](https://www.google.com/maps/place/University+of+Colorado+Colorado+Springs/@38.8934297,-104.7992445,27a,35y,258.51h,75.06t/data=!3m1!1e3!4m5!3m4!1s0x87134fa088fe399d:0x92cadf3962c058c4!8m2!3d38.8968312!4d-104.8049528)

![caption: 3D view showing the angle of view of the surveillance camera used for UCCS dataset. Image: Google Maps](assets/uccs_map_3d.jpg)


### Funding

The UnConstrained College Students dataset is associated with two main research papers: "Large Scale Unconstrained Open Set Face Database" and "Unconstrained Face Detection and Open-Set Face Recognition Challenge". Collectively, these papers and the creation of the dataset have received funding from the following organizations:

- ONR (Office of Naval Research) MURI (The Department of Defense Multidisciplinary University Research Initiative) grant N00014-08-1-0638
- Army SBIR (Small Business Innovation Research) grant W15P7T-12-C-A210
- SOCOM (Special Operations Command) SBIR (Small Business Innovation Research) grant H92222-07-P-0020
- National Science Foundation Grant IIS-1320956
- ODNI (Office of Director of National Intelligence)
- IARPA (Intelligence Advance Research Projects Activity) R&D contract 2014-14071600012

### Opting Out

If you attended University of Colorado Colorado Springs and were captured by the long range surveillance camera used to create this dataset, there is unfortunately currently no way to be removed. The authors do not provide any options for students to opt-out nor were students informed they would be used for training face recognition. According to the authors, the lack of any consent or knowledge of participation is what provides part of the value of Unconstrained College Students Dataset.

### Ethics

- Please direct any questions about the ethics of the dataset to the University of Colorado Colorado Springs [Ethics and Compliance Office](https://www.uccs.edu/compliance/)
- For further technical information about the dataset, visit the [UCCS dataset project page](https://vast.uccs.edu/Opensetface). 

### Downloads

- Download EXIF data for UCCS photos: [uccs_camera_exif.csv](https://nyc3.digitaloceanspaces.com/megapixels/v1/datasets/uccs/assets/uccs_camera_exif.csv)

{% include 'cite_our_work.html' %}

{% include 'last_updated.html' %}

### Footnotes

[^funding_sb]: Sapkota, Archana and Boult, Terrance. "Large Scale Unconstrained Open Set Face Database." 2013.
[^funding_uccs]: Günther, M. et. al. "Unconstrained Face Detection and Open-Set Face Recognition Challenge," 2018. Arxiv 1708.02337v3.
[^surv_face_qmul]: "Surveillance Face Recognition Challenge". [SemanticScholar](https://www.semanticscholar.org/paper/Surveillance-Face-Recognition-Challenge-Cheng-Zhu/2306b2a8fba28539306052764a77a0d0f5d1236a)