The VGG Face 2 dataset includes approximately 1,331 actresses, 139 presidents, 16 wives, 3 husbands, 2 snooker player, and 1 guru
Names and descriptions
The original VGGF2 name list has been updated with the results returned from Google Knowledge
Names with a similarity score greater than 0.75 where automatically updated. Scores computed using import difflib; seq = difflib.SequenceMatcher(a=a.lower(), b=b.lower()); score = seq.ratio()
The 97 names with a score of 0.75 or lower were manually reviewed and includes name changes validating using Wikipedia.org results for names such as "Bruce Jenner" to "Caitlyn Jenner", spousal last-name changes, and discretionary changes to improve search results such as combining nicknames with full name when appropriate, for example changing "Aleksandar Petrović" to "Aleksandar 'Aco' Petrović" and minor changes such as "Mohammad Ali" to "Muhammad Ali"
The 'Description' text was automatically added when the Knowledge Graph score was greater than 250
TODO
create name list, and populate with Knowledge graph information like LFW
make list of interesting number stats, by the numbers
make list of interesting important facts
write intro abstract
write analysis of usage
find examples, citations, and screenshots of useage