# Data Dictionary — House Music Intelligence Database

> The field-level contract for every dataset and API response.

## Provenance model

Every important field is backed by a `field_sources[field]` object:

```
{
  "value": "South Africa",
  "source_name": "Wikipedia",
  "source_url": "https://en.wikipedia.org/wiki/Black_Coffee_(DJ)",
  "last_verified_date": "2026-06-08",
  "confidence_score": 88
}
```

- **confidence_score** — 0-100. Higher = more corroborated / more authoritative source.
- **last_verified_date** — ISO date the value was last confirmed against its source.

## artists (CSV columns / JSON keys)

| Field | Type | Notes |
| --- | --- | --- |
| slug | string | stable identifier, used in all URLs |
| artist_name | string | display name |
| real_name | string | if public |
| origin_city / origin_country | string | birthplace / scene origin |
| current_city / current_country | string | if public |
| primary_scene | string | e.g. "Afro House", "Chicago House" |
| genres | string[] | normalized to the controlled taxonomy |
| specific_house_subgenres | string[] | controlled taxonomy |
| labels_affiliated / record_label_owned | string[] / string | |
| booking_agency / booking_email / manager_email / press_email | string | only if public |
| website, instagram, spotify, soundcloud, beatport, discogs, resident_advisor, wikipedia, … | url | official/public links |
| monthly_listeners, spotify_followers, instagram_followers | number | approximate, sourced |
| top_releases | object[] | { title, label, release_date, type } |
| scores | object | see Scores below |
| source_urls | url[] | all sources backing the record |
| field_sources | object | per-field provenance (see above) |
| confidence_score | number | overall record confidence 0-100 |
| last_verified_date | ISO date | |
| suggested_citation | string | ready-to-use citation |

## Scores (0-100 each)

influence_score, reach_score, house_crew_relevance_score, booking_potential_score,
emerging_artist_score, contact_confidence_score, citation_confidence_score,
agent_readability_score, brand_fit_score, interview_potential_score,
sponsorship_potential_score, education_value_score, house_crew_priority_score.

## Genre taxonomy

See [/house-music-genre-taxonomy.md](/house-music-genre-taxonomy.md).
