From 98385977e777fa18019d975ad160cc5725e9001d Mon Sep 17 00:00:00 2001 From: adamhrv Date: Thu, 2 May 2019 19:57:21 +0200 Subject: fix typos --- site/public/datasets/ijb_c/index.html | 6 +++++- 1 file changed, 5 insertions(+), 1 deletion(-) (limited to 'site/public/datasets/ijb_c') diff --git a/site/public/datasets/ijb_c/index.html b/site/public/datasets/ijb_c/index.html index 3bc23ca5..f58be23f 100644 --- a/site/public/datasets/ijb_c/index.html +++ b/site/public/datasets/ijb_c/index.html @@ -75,7 +75,11 @@
nist.gov

[ page under development ]

The IARPA Janus Benchmark C is a dataset created by

-
 A visualization of the IJB-C dataset
A visualization of the IJB-C dataset
+
 A visualization of the IJB-C dataset
A visualization of the IJB-C dataset

Research notes

+

From original papers: https://noblis.org/wp-content/uploads/2018/03/icb2018.pdf

+

Collection for the dataset began by identifying CreativeCommons subject videos, which are often more scarce thanCreative Commons subject images. Search terms that re-sulted in large quantities of person-centric videos (e.g. “in-terview”) were generated and translated into numerous lan-guages including Arabic, Korean, Swahili, and Hindi to in-crease diversity of the subject pool. Certain YouTube userswho upload well-labeled, person-centric videos, such as the World Economic Forum and the International University Sports Federation were also identified. Titles of videos per-taining to these search terms and usernames were scrapedusing the YouTube Data API and translated into English us-ing the Yandex Translate API4. Pattern matching was per-formed to extract potential names of subjects from the trans-lated titles, and these names were searched using the Wiki-data API to verify the subject’s existence and status as a public figure, and to check for Wikimedia Commons im-agery. Age, gender, and geographic region were collectedusing the Wikipedia API.Using the candidate subject names, Creative Commonsimages were scraped from Google and Wikimedia Com-mons, and Creative Commons videos were scraped fromYouTube. After images and videos of the candidate subjectwere identified, AMT Workers were tasked with validat-ing the subject’s presence throughout the video. The AMTWorkers marked segments of the video in which the subjectwas present, and key frames

+

IARPA funds Italian researcher https://www.micc.unifi.it/projects/glaivejanus/

+

Who used IJB-C?

-- cgit v1.2.3-70-g09d2