summaryrefslogtreecommitdiff
path: root/site/content/pages/datasets/uccs/index.md
blob: 68fff4db0713ac0ecb0e37f2b2cc800151805666 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
------------

status: published
title: UnConstrained College Students
slug: uccs
desc: <span class="dataset-name">UnConstrained College Students</span> is a dataset of long-range surveillance photos of students on University of Colorado in Colorado Springs campus
subdesc: The UnConstrained College Students dataset includes 16,149 images of 1,732 students, faculty, and pedestrians and is used for developing face recognition and face detection algorithms
image: assets/background.jpg
cssclass: dataset
image: assets/background.jpg
slug: uccs
published: 2019-2-23
updated: 2019-4-15
authors: Adam Harvey

------------

## UnConstrained College Students

### sidebar
### end sidebar

UnConstrained College Students (UCCS) is a dataset of long-range surveillance photos captured at University of Colorado Colorado Springs developed primarily for research and development of "face detection and recognition research towards surveillance applications"[^uccs_vast]. According to the authors of two papers associated with the dataset, over 1,700 students and pedestrians were "photographed using a long-range high-resolution surveillance camera without their knowledge".[^funding_uccs] In this investigation, we examine the contents of the dataset, funding sources, photo EXIF data, and information from publicly available research project citations.


The UCCS dataset includes over 1,700 unique identities, most of which are students walking to and from class. As of 2018, it was the "largest surveillance [face recognition] benchmark in the public domain."[^surv_face_qmul] The photos were taken during the spring semesters of 2012 &ndash; 2013 on the West Lawn of the University of Colorado Colorado Springs campus. The photographs were timed to capture students during breaks between their scheduled classes in the morning and afternoon during Monday through Thursday. "For example, a student taking Monday-Wednesday classes at 12:30 PM will show up in the camera on almost every Monday and Wednesday."[^sapkota_boult]. 


![caption: Example images from the UnConstrained College Students Dataset. ](assets/uccs_grid.jpg)

The long-range surveillance images in the UnContsrained College Students dataset were captured using a Canon 7D 18 megapixel digital camera fitted with a Sigma 800mm F5.6 EX APO DG HSM telephoto lens and pointed out an office window across the university's West Lawn. The students were photographed from a distance of approximately 150 meters through an office window. "The camera [was] programmed to start capturing images at specific time intervals between classes to maximize the number of faces being captured."[^sapkota_boult]
Their setup made it impossible for students to know they were being photographed, providing the researchers with realistic surveillance images to help build face detection and recognition systems for real world applications in defense, intelligence, and commercial applications.

![caption: The location at University of Colorado Colorado Springs where students were surreptitiously photographed with a long-range surveillance camera for use in a defense and intelligence agency funded research project on face recognition. Image: Google Maps](assets/uccs_map_aerial.jpg)

In the two papers associated with the release of the UCCS dataset ([Unconstrained Face Detection and Open-Set Face Recognition Challenge](https://www.semanticscholar.org/paper/Unconstrained-Face-Detection-and-Open-Set-Face-G%C3%BCnther-Hu/d4f1eb008eb80595bcfdac368e23ae9754e1e745) and [Large Scale Unconstrained Open Set Face Database](https://www.semanticscholar.org/paper/Large-scale-unconstrained-open-set-face-database-Sapkota-Boult/07fcbae86f7a3ad3ea1cf95178459ee9eaf77cb1)), the researchers disclosed their funding sources as ODNI (United States Office of Director of National Intelligence), IARPA (Intelligence Advance Research Projects Activity), ONR MURI (Office of Naval Research and The Department of Defense Multidisciplinary University Research Initiative), Army SBIR (Small Business Innovation Research), SOCOM SBIR (Special Operations Command and Small Business Innovation Research), and the National Science Foundation. Further, UCCS's VAST site explicity [states](https://vast.uccs.edu/project/iarpa-janus/) they are part of the [IARPA Janus](https://www.iarpa.gov/index.php/research-programs/janus), a face recognition project developed to serve the needs of national intelligence interests.

The EXIF data embedded in the images shows that the photo capture times follow a similar pattern, but also highlights that the vast majority of photos (over 7,000) were taken on Tuesdays around noon during students' lunch break. The lack of any photos taken on Friday shows that the researchers were only interested in capturing images of students.

![caption: UCCS photos captured per weekday &copy; megapixels.cc](assets/uccs_exif_plot_days.png)

![caption: UCCS photos captured per weekday &copy; megapixels.cc](assets/uccs_exif_plot.png)

The two research papers associated with the release of the UCCS dataset ([Unconstrained Face Detection and Open-Set Face Recognition Challenge](https://www.semanticscholar.org/paper/Unconstrained-Face-Detection-and-Open-Set-Face-G%C3%BCnther-Hu/d4f1eb008eb80595bcfdac368e23ae9754e1e745) and [Large Scale Unconstrained Open Set Face Database](https://www.semanticscholar.org/paper/Large-scale-unconstrained-open-set-face-database-Sapkota-Boult/07fcbae86f7a3ad3ea1cf95178459ee9eaf77cb1)), acknowledge that the primary funding sources for their work were United States defense and intelligence agencies. Specifically, development of the UnContrianed College Students dataset was funded by the Intelligence Advanced Research Projects Activity (IARPA), Office of Director of National Intelligence (ODNI), Office of Naval Research and The Department of Defense Multidisciplinary University Research Initiative (ONR MURI), Small Business Innovation Research (SBIR), Special Operations Command and Small Business Innovation Research (SOCOM SBIR), and the National Science Foundation. Further, UCCS's VAST site explicitly [states](https://vast.uccs.edu/project/iarpa-janus/) they are part of the [IARPA Janus](https://www.iarpa.gov/index.php/research-programs/janus), a face recognition project developed to serve the needs of national intelligence interests, clearly establishing the the funding sources and immediate benefactors of this dataset are United States defense and intelligence agencies.


Although the images were first captured in 2012 &ndash; 2013 the dataset was not publicly released until 2016. Then in 2017 the UCCS face dataset formed the basis for a defense and intelligence agency funded [face recognition challenge](http://www.face-recognition-challenge.com/) project at the International Joint Biometrics Conference in Denver, CO. And in 2018 the dataset was again used for the [2nd Unconstrained Face Detection and Open Set Recognition Challenge](https://erodner.github.io/ial2018eccv/) at the European Computer Vision Conference (ECCV) in Munich, Germany. 

As of April 15, 2019, the UCCS dataset is no longer available for public download. But during the three years it was publicly available (2016-2019) the UCCS dataset appeared in at least 6 publicly available research papers including verified usage from Beihang University who is known to provide research and development for China's military.




{% include 'dashboard.html' %}

{% include 'supplementary_header.html' %}


To show the types of face images used in the UCCS student dataset while protecting their individual privacy, a generative adversarial network was used to interpolate between identities in the dataset. The image below shows a generative adversarial network trained on the UCCS face bounding box areas from 16,000 images and over 90,000 face regions.

![caption: GAN generated approximations of students in the UCCS dataset. &copy; megapixels.cc 2018](assets/uccs_pgan_01.jpg)


=== columns 2

#### UCCS photos taken in 2012

| Date  | Photos |
| --- | --- |
| Feb 23, 2012 | 132 |
| March 6, 2012 | 288 |
| March 8, 2012 | 506 |
| March 13, 2012 | 160 |
| March 20, 2012 | 1,840 |
| March 22, 2012 | 445 |
| April 3, 2012 | 1,639 |
| April 12, 2012 | 14 |
| April 17, 2012 | 19 |
| April 24, 2012 | 63 |
| April 25, 2012 | 11 |
| April 26, 2012 | 20 |

===========

#### UCCS photos taken in 2013

| Date  | Photos |
| --- | --- |
| Jan 28, 2013 | 1,056 |
| Jan 29, 2013 | 1,561 |
| Feb 13, 2013 | 739 |
| Feb 19, 2013 | 723 |
| Feb 20, 2013 | 965 |
| Feb 26, 2013 | 736 |

=== end columns


### Location

The location of the camera and subjects can confirmed using several visual cues in the dataset images: the unique pattern of the sidewalk that is only used on the UCCS Pedestrian Spine near the West Lawn, the two UCCS sign poles with matching graphics still visible in Google Street View, the no parking sign and directionality of its arrow, the back of street sign next to it, the slight bend in the sidewalk, the presence of cars passing in the background of the image, and the far wall of the parking garage all match images in the dataset. The [original papers](https://www.semanticscholar.org/paper/Large-scale-unconstrained-open-set-face-database-Sapkota-Boult/07fcbae86f7a3ad3ea1cf95178459ee9eaf77cb1) also provides another clue: a [picture of the camera](https://www.semanticscholar.org/paper/Large-scale-unconstrained-open-set-face-database-Sapkota-Boult/07fcbae86f7a3ad3ea1cf95178459ee9eaf77cb1/figure/1) inside the office that was used to create the dataset. The window view in this image provides another match for the brick pattern on the north facade of the Kraember Family Library and the green metal fence along the sidewalk. View the [location on Google Maps](https://www.google.com/maps/place/University+of+Colorado+Colorado+Springs/@38.8934297,-104.7992445,27a,35y,258.51h,75.06t/data=!3m1!1e3!4m5!3m4!1s0x87134fa088fe399d:0x92cadf3962c058c4!8m2!3d38.8968312!4d-104.8049528)

![caption: 3D view showing the angle of view of the surveillance camera used for UCCS dataset. Image: Google Maps](assets/uccs_map_3d.jpg)


### Funding

The UnConstrained College Students dataset is associated with two main research papers: "Large Scale Unconstrained Open Set Face Database" and "Unconstrained Face Detection and Open-Set Face Recognition Challenge". Collectively, these papers and the creation of the dataset have received funding from the following organizations:

- ONR (Office of Naval Research) MURI (The Department of Defense Multidisciplinary University Research Initiative) grant N00014-08-1-0638
- Army SBIR (Small Business Innovation Research) grant W15P7T-12-C-A210
- SOCOM (Special Operations Command) SBIR (Small Business Innovation Research) grant H92222-07-P-0020
- National Science Foundation Grant IIS-1320956
- ODNI (Office of Director of National Intelligence)
- IARPA (Intelligence Advance Research Projects Activity) R&D contract 2014-14071600012

### Opting Out

If you attended University of Colorado Colorado Springs and were captured by the long range surveillance camera used to create this dataset, there is unfortunately currently no way to be removed. The authors do not provide any options for students to opt-out nor were students informed they would be used for training face recognition. According to the authors, the lack of any consent or knowledge of participation is what provides part of the value of Unconstrained College Students Dataset.

### Ethics

- Please direct any questions about the ethics of the dataset to the University of Colorado Colorado Springs [Ethics and Compliance Office](https://www.uccs.edu/compliance/)
- For further technical information about the UnConstrained College Students dataset, visit the [UCCS dataset project page](https://vast.uccs.edu/Opensetface). 

### Downloads

- Download EXIF data for UCCS photos: [uccs_camera_exif.csv](https://nyc3.digitaloceanspaces.com/megapixels/v1/datasets/uccs/assets/uccs_camera_exif.csv)

{% include 'cite_our_work.html' %}

### Footnotes

[^uccs_vast]: "2nd Unconstrained Face Detection and Open Set Recognition Challenge." <https://vast.uccs.edu/Opensetface/>. Accessed April 15, 2019.
[^sapkota_boult]: Sapkota, Archana and Boult, Terrance. "Large Scale Unconstrained Open Set Face Database." 2013.
[^funding_uccs]: Günther, M. et. al. "Unconstrained Face Detection and Open-Set Face Recognition Challenge," 2018. Arxiv 1708.02337v3.
[^surv_face_qmul]: "Surveillance Face Recognition Challenge". [SemanticScholar](https://www.semanticscholar.org/paper/Surveillance-Face-Recognition-Challenge-Cheng-Zhu/2306b2a8fba28539306052764a77a0d0f5d1236a)