95 / One million commits
How to get famous on Hugging Face

The One Million Commits dataset now accessible via Hugging Face, a great resource for machine learning projects relating to source code analyses and commit history studies. Additionally, consider the guidelines in the model card paper for ethical AI development and transparent model reporting.

Timeline

00:00 Intro
01:28 Beer review
02:39 Hugging Face
06:20 One million commits
12:30 Dataset cards
13:37 AI has original sin

Participants

fulldecent
@fulldecent

William Entriken

dtedesco1
@dtedesco1

Daniel Tedesco

037
@037

AKM

t012n4d0
@t012n4d0

???


Episode notes

Edit these notes…
  1. One Million Commits // announcement https://twitter.com/fulldecent/status/1706514152003338322 // https://huggingface.co/datasets/fulldecent/one-million-commits
    1. Take a look at hugging face
    2. First dataset
    3. Various things we can do with the dataset
  2. Will’s first Hugging Face dataset https://twitter.com/fulldecent/status/1703874685446824297
  3. The model card paper: https://arxiv.org/pdf/1810.03993.pdf