# Data Sources

> The 53 sources the House Music Intelligence Database is built from. API-first, compliance-first: official APIs where available; robots.txt, rate limits and ToS respected; public data only.

- **Canonical URL:** https://database.worldfamoushousecrew.org/sources.md
- **Publisher:** World Famous House Crew
- **Last updated:** June 23, 2026
- **Machine-readable index:** https://database.worldfamoushousecrew.org/llms.txt

| Source | Type | API | Automated access | Rate limit | Notes |
| --- | --- | --- | --- | --- | --- |
| Beatport | catalog | no | manual/per-site | n/a | No public API for V1. Use public chart pages only where robots.txt permits; otherwise manual. |
| Discogs | catalog | yes | via API | 60/min (authenticated) | Use official API + token. Respect 429s. |
| MusicBrainz | catalog | yes | via API | 1 req/sec | Requires descriptive User-Agent with contact. |
| Traxsource | catalog | no | manual/per-site | n/a | Public charts only; check robots.txt. |
| Wikidata | catalog | yes | allowed | fair use | SPARQL endpoint. Great for cross-IDs. |
| Beatport Charts | charts | no | manual/per-site | n/a | Discovery seed for top tracks/artists. |
| Shazam Charts | charts | no | manual/per-site | n/a | Trend signal; limited access. |
| Traxsource Charts | charts | no | manual/per-site | n/a | Discovery seed for soulful/deep/Afro house. |
| Reddit communities | community | yes | via API | per Reddit API terms | r/house, r/DeepHouse etc. via official API. |
| Bandsintown | events | yes | via API | fair use | Needs app_id. |
| Dice | events | no | manual/per-site | n/a | No public API; public event pages only where permitted. |
| EDM Train | events | yes | via API | fair use | API available with key. |
| Eventbrite | events | yes | via API | fair use | Official API with OAuth token. |
| Facebook event pages | events | no | manual/per-site | n/a | Public event info only; no login/scraping of private data. |
| Festival websites | events | no | manual/per-site | per-site | Category: lineup pages of major festivals; per-site robots.txt + ToS. |
| Promoter websites | events | no | manual/per-site | per-site | Category: promoter event listings; per-site compliance. |
| Resident Advisor | events | no | manual/per-site | n/a | GraphQL endpoint exists but is unofficial; respect robots.txt and ToS. |
| Songkick | events | yes | via API | fair use | Needs API key (request access). |
| Venue calendars | events | no | manual/per-site | per-site | Category: club/venue event calendars; per-site compliance. |
| Afro House labels | label | no | manual/per-site | per-site | Category: Afro House labels (e.g. MoBlack, Get Physical AfroBros, Offering Recordings). |
| Anjunadeep | label | no | manual/per-site | per-site | Roster + demo policy. |
| Defected Records | label | no | manual/per-site | per-site | Roster + release pages. |
| Glitterbox | label | no | manual/per-site | per-site | Defected sub-brand. |
| Kaoz Theory | label | no | manual/per-site | per-site | Kerri Chandler label. |
| Nervous Records | label | no | manual/per-site | per-site | NYC house catalog. |
| Spinnin Deep | label | no | manual/per-site | per-site | Spinnin sub-label. |
| Strictly Rhythm | label | no | manual/per-site | per-site | NYC house catalog. |
| Toolroom Records | label | no | manual/per-site | per-site | Roster + demo policy. |
| Beacons | links | no | manual/per-site | n/a | Public link-in-bio pages. |
| Hypeddit | links | no | manual/per-site | n/a | Marketing/link pages. |
| Linktree | links | no | manual/per-site | n/a | Public link-in-bio pages; extract official links only. |
| ToneDen | links | no | manual/per-site | n/a | Marketing/link pages. |
| Apple Music Playlists | playlists | yes | via API | fair use | Discovery via Apple editorial playlists (MusicKit). |
| Spotify Playlists | playlists | yes | via API | rolling window | Discovery via editorial house playlists (Spotify API). |
| Boiler Room | press | no | manual/per-site | n/a | Set/interview links; manual or via YouTube. |
| DJ Mag | press | no | manual/per-site | n/a | Press mentions / Top 100; manual citation. |
| Mixmag | press | no | manual/per-site | n/a | Press mentions; manual citation. |
| Wikipedia | press | yes | allowed | fair use | MediaWiki API. |
| AllMusic | reference | no | manual/per-site | n/a | No public API; reference only. |
| Google Search | search | yes | via API | quota/day | Programmable Search Engine / SerpAPI with advanced operators for discovery. |
| Instagram | social | no | manual/per-site | n/a | Collect only public profile URLs linked from official artist/label pages. No login, no scraping of private data. |
| TikTok | social | no | manual/per-site | n/a | Public profile URLs only; no private data. |
| Apple Music | streaming | yes | via API | fair use | MusicKit API — requires Apple developer token. |
| Bandcamp | streaming | no | manual/per-site | n/a | No general public API; link-only. |
| Last.fm | streaming | yes | via API | fair use | Needs API key. |
| SoundCloud | streaming | yes | via API | varies | API registration is gated; link-only in V1. |
| Spotify | streaming | yes | via API | rolling 30s window | Client-credentials flow. No monthly-listener field in API (use followers/popularity). |
| YouTube | streaming | yes | via API | quota units/day | YouTube Data API v3. |
| YouTube Music | streaming | no | manual/per-site | n/a | No official API; reachable via YouTube Data API for some data. |
| 1001Tracklists | tracklists | no | manual/per-site | n/a | No public API and strict ToS; use only with permission / manual. |
| Artist websites | web | no | manual/per-site | per-site | Category: official artist sites — booking/management/contact links. |
| Booking agency websites | web | no | manual/per-site | per-site | Category: agency rosters + booking contacts. |
| Label websites | web | no | manual/per-site | per-site | Category: official label sites — roster/demo/contact. |

## How to cite

```
House Music Intelligence Database. "Data Sources." Published by World Famous House Crew. Last verified June 23, 2026. URL: https://database.worldfamoushousecrew.org/sources.md
```
