Week 4

Week 4 (July 18th - July 22nd)

July 19th:

In our meeting my Professor advised that I create a full table of CLIP ViT-L/14, ViT-L/14-336px, ViT-B/32, … (i.e. all the open CLIP models published) as well as start creating a directory of dataset for Politicans from the internet.

For our project we decided to have a Politicians dataset which included:

  • 100 mayor from the 100 most populous setings
  • All 436 U.S. House of Representatives members
  • All 100 U.S. Senators

Basically, I went on Wikipedia and bgean downloading all images of politicians and putting them in a folder on the server. In order to download all these images at once I created a python script which pulls images from webpages.

Here are some example photos that where downloaded (images downloaded where in smallest format):

CLIP table CLIP table CLIP table CLIP table

July 21st:

As we brainstormed more we came up with two rough draft ideas 1) Similar to Winoground where we have 2 pictures and 2 captions – and we compute the same metrics as Winoground – This is a picture of a congressperson from Texas – This is a picture of a congressperson from Minnesota 2) Design a different tests like the ones you were doing before with words. Input 1 image – and 6 captions – [This is a picture of a congressperson] [This is a picture of a US representative in the Senate] [This is a picture of a Professor] [This is a picture of an assistant] [This is a picture of a receptionist]

July 22nd:

Today I started going through all the CLIP models and running the Winoground dataset on them. Here is a summary of my results for each different model:

Screen Shot 2022-09-28 at 9 54 00 PM

Upcoming

  • Start uploading all Politicans datasets to the scratch folder
  • Keep Brainstorming Tests for Politician Dataset
  • Start Tests on Politician Dataset
Written on July 18, 2022