1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
|
<!doctype html>
<html>
<head>
<title>MegaPixels</title>
<meta charset="utf-8" />
<meta name="author" content="Adam Harvey" />
<meta name="description" content="A large scale image dataset for face recognition" />
<meta name="referrer" content="no-referrer" />
<meta name="viewport" content="width=device-width, initial-scale=1.0, user-scalable=yes" />
<link rel='stylesheet' href='/assets/css/fonts.css' />
<link rel='stylesheet' href='/assets/css/tabulator.css' />
<link rel='stylesheet' href='/assets/css/css.css' />
<link rel='stylesheet' href='/assets/css/leaflet.css' />
<link rel='stylesheet' href='/assets/css/applets.css' />
</head>
<body>
<header>
<a class='slogan' href="/">
<div class='logo'></div>
<div class='site_name'>MegaPixels</div>
<span class='sub'>The Darkside of Datasets</span>
</a>
<div class='links'>
<a href="/datasets/">Datasets</a>
<a href="/research/">Research</a>
<a href="/about/">About</a>
</div>
</header>
<div class="content">
<section><h1>VGG Faces2</h1>
</section><section><div class='meta'><div><div class='gray'>Created</div><div>2018</div></div><div><div class='gray'>Images</div><div>3.3M</div></div><div><div class='gray'>People</div><div>9,000</div></div><div><div class='gray'>Created From</div><div>Scraping search engines</div></div><div><div class='gray'>Search available</div><div>[Searchable](#)</div></div></div></section><section><p>VGG Face2 is the updated version of the VGG Face dataset and now includes over 3.3M face images from over 9K people. The identities were selected by taking the top 500K identities in Google's Knowledge Graph of celebrities and then selecting only the names that yielded enough training images. The dataset was created in the UK but funded by Office of Director of National Intelligence in the United States.</p>
</section><section class='applet_container'><div class='applet' data-payload='{"command": "face_search"}'></div></section><section class='applet_container'><div class='applet' data-payload='{"command": "name_search"}'></div></section><section class='applet_container'><div class='applet' data-payload='{"command": "load_file assets/lfw_names_gender_kg_min.csv", "fields": ["Name, Images, Gender, Description"]}'></div></section><section><h2>VGG Face2 by the Numbers</h2>
<ul>
<li>1,331 actresses, 139 presidents</li>
<li>3 husbands and 16 wives</li>
<li>2 snooker player</li>
<li>1 guru</li>
<li>1 pornographic actress</li>
<li>3 computer programmer</li>
</ul>
<h1>Names and descriptions</h1>
<ul>
<li>The original VGGF2 name list has been updated with the results returned from Google Knowledge</li>
<li>Names with a similarity score greater than 0.75 where automatically updated. Scores computed using <code>import difflib; seq = difflib.SequenceMatcher(a=a.lower(), b=b.lower()); score = seq.ratio()</code></li>
<li>The 97 names with a score of 0.75 or lower were manually reviewed and includes name changes validating using Wikipedia.org results for names such as "Bruce Jenner" to "Caitlyn Jenner", spousal last-name changes, and discretionary changes to improve search results such as combining nicknames with full name when appropriate, for example changing "Aleksandar Petrović" to "Aleksandar 'Aco' Petrović" and minor changes such as "Mohammad Ali" to "Muhammad Ali"</li>
<li>The 'Description' text was automatically added when the Knowledge Graph score was greater than 250</li>
</ul>
<h1>TODO</h1>
<ul>
<li>create name list, and populate with Knowledge graph information like LFW</li>
<li>make list of interesting number stats, by the numbers</li>
<li>make list of interesting important facts</li>
<li>write intro abstract</li>
<li>write analysis of usage</li>
<li>find examples, citations, and screenshots of useage</li>
<li>find list of companies using it for table</li>
<li>create montages of the dataset, like LFW</li>
<li>create right to removal information</li>
</ul>
</section>
</div>
<footer>
<div>
<a href="/">MegaPixels.cc</a>
<a href="/about/disclaimer/">Disclaimer</a>
<a href="/about/terms/">Terms of Use</a>
<a href="/about/privacy/">Privacy</a>
<a href="/about/">About</a>
<a href="/about/team/">Team</a>
</div>
<div>
MegaPixels ©2017-19 Adam R. Harvey /
<a href="https://ahprojects.com">ahprojects.com</a>
</div>
</footer>
</body>
<script src="/assets/js/dist/index.js"></script>
</html>
|