From df7adaa39604177bca58332354950f003150c707 Mon Sep 17 00:00:00 2001 From: Jules Laplace Date: Fri, 8 Mar 2019 21:29:18 +0100 Subject: put chart on the dataset pages and write a lil copy --- site/content/pages/datasets/brainwash/index.md | 2 +- site/content/pages/datasets/cofw/index.md | 8 ++++++++ site/content/pages/datasets/lfw/index.md | 2 ++ site/includes/chart.html | 11 +++++++++++ site/includes/citations.html | 8 +++++++- site/public/datasets/brainwash/index.html | 2 +- site/public/datasets/cofw/index.html | 2 +- site/public/datasets/lfw/index.html | 2 +- 8 files changed, 32 insertions(+), 5 deletions(-) create mode 100644 site/includes/chart.html diff --git a/site/content/pages/datasets/brainwash/index.md b/site/content/pages/datasets/brainwash/index.md index 83c30be8..c60bdb23 100644 --- a/site/content/pages/datasets/brainwash/index.md +++ b/site/content/pages/datasets/brainwash/index.md @@ -44,11 +44,11 @@ Since it's publication by Stanford in 2015, the Brainwash dataset has appeared i {% include 'map.html' %} - {% include 'supplementary_header.html' %} {% include 'citations.html' %} +{% include 'chart.html' %} ### Additional Information diff --git a/site/content/pages/datasets/cofw/index.md b/site/content/pages/datasets/cofw/index.md index 7a668cec..3b1cdb2b 100644 --- a/site/content/pages/datasets/cofw/index.md +++ b/site/content/pages/datasets/cofw/index.md @@ -44,6 +44,14 @@ Robust face landmark estimation under occlusion +{% include 'map.html' %} + +{% include 'supplementary_header.html' %} + +{% include 'citations.html' %} + +{% include 'chart.html' %} + TODO - replace graphic diff --git a/site/content/pages/datasets/lfw/index.md b/site/content/pages/datasets/lfw/index.md index 7ccbfb0b..8d074501 100644 --- a/site/content/pages/datasets/lfw/index.md +++ b/site/content/pages/datasets/lfw/index.md @@ -52,6 +52,8 @@ The *Names and Faces* dataset was the first face recognition dataset created ent {% include 'citations.html' %} +{% include 'chart.html' %} + ### Commercial Use Add a paragraph about how usage extends far beyond academia into research centers for largest companies in the world. And even funnels into CIA funded research in the US and defense industry usage in China. diff --git a/site/includes/chart.html b/site/includes/chart.html new file mode 100644 index 00000000..63108df1 --- /dev/null +++ b/site/includes/chart.html @@ -0,0 +1,11 @@ +
+

+ This bar chart presents a ranking of the top countries where citations originated. Mouse over individual columns + to see yearly totals. Colors are only assigned to the top 10 overall countries. +

+ +
+ +
+
+
diff --git a/site/includes/citations.html b/site/includes/citations.html index ed54b9b1..a37cc43a 100644 --- a/site/includes/citations.html +++ b/site/includes/citations.html @@ -1,6 +1,12 @@

Citations

-

Add graph showing distribution by country. Add information about how the citations were generated. Add button/link to download CSV

+

+ Citations were collected from Semantic Scholar, a website which aggregates + and indexes research papers. Metadata was extracted from these papers, including extracting names of institutions automatically from PDFs, and then the addresses were geocoded. Data is not yet manually verified, and reflects anytime the paper was cited. Some papers may only mention the dataset in passing, while others use it as part of their research methodology. +

+

+ Add button/link to download CSV +

\ No newline at end of file diff --git a/site/public/datasets/brainwash/index.html b/site/public/datasets/brainwash/index.html index 33a10dde..b52cbca3 100644 --- a/site/public/datasets/brainwash/index.html +++ b/site/public/datasets/brainwash/index.html @@ -32,7 +32,7 @@

Brainwash is a face detection dataset created from the Brainwash Cafe's livecam footage including 11,918 images of "everyday life of a busy downtown cafe 1". The images are used to develop face detection algorithms for the "challenging task of detecting people in crowded scenes" and tracking them.

Before closing in 2017, Brainwash Cafe was a "cafe and laundromat" located in San Francisco's SoMA district. The cafe published a publicy available livestream from the cafe with a view of the cash register, performance stage, and seating area.

Since it's publication by Stanford in 2015, the Brainwash dataset has appeared in several notable research papers. In September 2016 four researchers from the National University of Defense Technology in Changsha, China used the Brainwash dataset for a research study on "people head detection in crowded scenes", concluding that their algorithm "achieves superior head detection performance on the crowded scenes dataset 2". And again in 2017 three researchers at the National University of Defense Technology used Brainwash for a study on object detection noting "the data set used in our experiment is shown in Table 1, which includes one scene of the brainwash dataset 3".

-
 An sample image from the Brainwash dataset used for training face and head detection algorithms for surveillance. The datset contains about 12,000 images. License: Open Data Commons Public Domain Dedication (PDDL)
An sample image from the Brainwash dataset used for training face and head detection algorithms for surveillance. The datset contains about 12,000 images. License: Open Data Commons Public Domain Dedication (PDDL)
 49 of the 11,918 images included in the Brainwash dataset. License: Open Data Commons Public Domain Dedication (PDDL)
49 of the 11,918 images included in the Brainwash dataset. License: Open Data Commons Public Domain Dedication (PDDL)

Information Supply Chain

To understand how and where this dataset has been used, organizations using the dataset are plotted below. The data is generated by collecting all citations for all the original research papers associated with the dataset. The PDFs are then converted to text and the organization names are extracted and geocoded. Because of the automated approach to extracting data, not all organizations have been confirmed as using the dataset. This visualization is provided to help locate and confirm usage and will be updated as data noise is reduced.

Academic
Industry
Government
Data is compiled from Semantic Scholar and not yet manually verified.

Supplementary Information

Citations

Add graph showing distribution by country. Add information about how the citations were generated. Add button/link to download CSV

Additional Information

+
 An sample image from the Brainwash dataset used for training face and head detection algorithms for surveillance. The datset contains about 12,000 images. License: Open Data Commons Public Domain Dedication (PDDL)
An sample image from the Brainwash dataset used for training face and head detection algorithms for surveillance. The datset contains about 12,000 images. License: Open Data Commons Public Domain Dedication (PDDL)
 49 of the 11,918 images included in the Brainwash dataset. License: Open Data Commons Public Domain Dedication (PDDL)
49 of the 11,918 images included in the Brainwash dataset. License: Open Data Commons Public Domain Dedication (PDDL)

Information Supply Chain

To understand how and where this dataset has been used, organizations using the dataset are plotted below. The data is generated by collecting all citations for all the original research papers associated with the dataset. The PDFs are then converted to text and the organization names are extracted and geocoded. Because of the automated approach to extracting data, not all organizations have been confirmed as using the dataset. This visualization is provided to help locate and confirm usage and will be updated as data noise is reduced.

Academic
Industry
Government
Data is compiled from Semantic Scholar and not yet manually verified.

Supplementary Information

Citations

Citations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Metadata was extracted from these papers, including extracting names of institutions automatically from PDFs, and then the addresses were geocoded. Data is not yet manually verified, and reflects anytime the paper was cited. Some papers may only mention the dataset in passing, while others use it as part of their research methodology.

Add button/link to download CSV

This bar chart presents a ranking of the top countries where citations originated. Mouse over individual columns to see yearly totals. Colors are only assigned to the top 10 overall countries.

Additional Information

diff --git a/site/public/datasets/cofw/index.html b/site/public/datasets/cofw/index.html index 82842955..b4addd20 100644 --- a/site/public/datasets/cofw/index.html +++ b/site/public/datasets/cofw/index.html @@ -41,7 +41,7 @@ To increase the number of training images, and since COFW has the exact same la

This research is supported by NSF Grant 0954083 and by the Office of the Director of National Intelligence (ODNI), Intelligence Advanced Research Projects Activity (IARPA), via IARPA R&D Contract No. 2014-14071600012.

https://www.cs.cmu.edu/~peiyunh/topdown/

-

TODO

+

Information Supply Chain

To understand how and where this dataset has been used, organizations using the dataset are plotted below. The data is generated by collecting all citations for all the original research papers associated with the dataset. The PDFs are then converted to text and the organization names are extracted and geocoded. Because of the automated approach to extracting data, not all organizations have been confirmed as using the dataset. This visualization is provided to help locate and confirm usage and will be updated as data noise is reduced.

Academic
Industry
Government
Data is compiled from Semantic Scholar and not yet manually verified.

Supplementary Information

Citations

Citations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Metadata was extracted from these papers, including extracting names of institutions automatically from PDFs, and then the addresses were geocoded. Data is not yet manually verified, and reflects anytime the paper was cited. Some papers may only mention the dataset in passing, while others use it as part of their research methodology.

Add button/link to download CSV

This bar chart presents a ranking of the top countries where citations originated. Mouse over individual columns to see yearly totals. Colors are only assigned to the top 10 overall countries.

TODO

- replace graphic

diff --git a/site/public/datasets/lfw/index.html b/site/public/datasets/lfw/index.html index f224e345..adad8aea 100644 --- a/site/public/datasets/lfw/index.html +++ b/site/public/datasets/lfw/index.html @@ -44,7 +44,7 @@

The Names and Faces dataset was the first face recognition dataset created entire from online photos. However, Names and Faces and LFW are not the first face recognition dataset created entirely "in the wild". That title belongs to the UCD dataset. Images obtained "in the wild" means using an image without explicit consent or awareness from the subject or photographer.

All 5,379 people in the Labeled Faces in The Wild Dataset. Showing one face per person
All 5,379 people in the Labeled Faces in The Wild Dataset. Showing one face per person

The Names and Faces dataset was the first face recognition dataset created entire from online photos. However, Names and Faces and LFW are not the first face recognition dataset created entirely "in the wild". That title belongs to the UCD dataset. Images obtained "in the wild" means using an image without explicit consent or awareness from the subject or photographer.

The Names and Faces dataset was the first face recognition dataset created entire from online photos. However, Names and Faces and LFW are not the first face recognition dataset created entirely "in the wild". That title belongs to the UCD dataset. Images obtained "in the wild" means using an image without explicit consent or awareness from the subject or photographer.

-

Information Supply Chain

To understand how and where this dataset has been used, organizations using the dataset are plotted below. The data is generated by collecting all citations for all the original research papers associated with the dataset. The PDFs are then converted to text and the organization names are extracted and geocoded. Because of the automated approach to extracting data, not all organizations have been confirmed as using the dataset. This visualization is provided to help locate and confirm usage and will be updated as data noise is reduced.

Academic
Industry
Government
Data is compiled from Semantic Scholar and not yet manually verified.

Supplementary Information

Citations

Add graph showing distribution by country. Add information about how the citations were generated. Add button/link to download CSV

Commercial Use

+

Information Supply Chain

To understand how and where this dataset has been used, organizations using the dataset are plotted below. The data is generated by collecting all citations for all the original research papers associated with the dataset. The PDFs are then converted to text and the organization names are extracted and geocoded. Because of the automated approach to extracting data, not all organizations have been confirmed as using the dataset. This visualization is provided to help locate and confirm usage and will be updated as data noise is reduced.

Academic
Industry
Government
Data is compiled from Semantic Scholar and not yet manually verified.

Supplementary Information

Citations

Citations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Metadata was extracted from these papers, including extracting names of institutions automatically from PDFs, and then the addresses were geocoded. Data is not yet manually verified, and reflects anytime the paper was cited. Some papers may only mention the dataset in passing, while others use it as part of their research methodology.

Add button/link to download CSV

This bar chart presents a ranking of the top countries where citations originated. Mouse over individual columns to see yearly totals. Colors are only assigned to the top 10 overall countries.

Commercial Use

Add a paragraph about how usage extends far beyond academia into research centers for largest companies in the world. And even funnels into CIA funded research in the US and defense industry usage in China.

Research, text, and graphics ©Adam Harvey / megapixels.cc