Data Sources
.mdThe 53 sources the House Music Intelligence Database is built from — 17 with official APIs. We are API-first and compliance-first: we use official APIs where available, respect robots.txt, rate limits and Terms of Service, and never bypass logins, paywalls, or anti-bot systems. We collect only public information.
catalog
| Source | API | Automated access | Rate limit | Status | Notes |
|---|---|---|---|---|---|
| Beatport | — | manual / per-site | n/a | idle | No public API for V1. Use public chart pages only where robots.txt permits; otherwise manual. |
| Discogs | API | via API | 60/min (authenticated) | idle | Use official API + token. Respect 429s. |
| MusicBrainz | API | via API | 1 req/sec | ok · June 22, 2026 | Requires descriptive User-Agent with contact. |
| Traxsource | — | manual / per-site | n/a | idle | Public charts only; check robots.txt. |
| Wikidata | API | ✅ allowed | fair use | ok · June 22, 2026 | SPARQL endpoint. Great for cross-IDs. |
streaming
| Source | API | Automated access | Rate limit | Status | Notes |
|---|---|---|---|---|---|
| Apple Music | API | via API | fair use | idle | MusicKit API — requires Apple developer token. |
| Bandcamp | — | manual / per-site | n/a | idle | No general public API; link-only. |
| Last.fm | API | via API | fair use | idle | Needs API key. |
| SoundCloud | API | via API | varies | idle | API registration is gated; link-only in V1. |
| Spotify | API | via API | rolling 30s window | idle | Client-credentials flow. No monthly-listener field in API (use followers/popularity). |
| YouTube | API | via API | quota units/day | idle | YouTube Data API v3. |
| YouTube Music | — | manual / per-site | n/a | idle | No official API; reachable via YouTube Data API for some data. |
charts
| Source | API | Automated access | Rate limit | Status | Notes |
|---|---|---|---|---|---|
| Beatport Charts | — | manual / per-site | n/a | idle | Discovery seed for top tracks/artists. |
| Shazam Charts | — | manual / per-site | n/a | idle | Trend signal; limited access. |
| Traxsource Charts | — | manual / per-site | n/a | idle | Discovery seed for soulful/deep/Afro house. |
playlists
| Source | API | Automated access | Rate limit | Status | Notes |
|---|---|---|---|---|---|
| Apple Music Playlists | API | via API | fair use | idle | Discovery via Apple editorial playlists (MusicKit). |
| Spotify Playlists | API | via API | rolling window | idle | Discovery via editorial house playlists (Spotify API). |
events
| Source | API | Automated access | Rate limit | Status | Notes |
|---|---|---|---|---|---|
| Bandsintown | API | via API | fair use | idle | Needs app_id. |
| Dice | — | manual / per-site | n/a | idle | No public API; public event pages only where permitted. |
| EDM Train | API | via API | fair use | idle | API available with key. |
| Eventbrite | API | via API | fair use | idle | Official API with OAuth token. |
| Facebook event pages | — | manual / per-site | n/a | idle | Public event info only; no login/scraping of private data. |
| Festival websites | — | manual / per-site | per-site | idle | Category: lineup pages of major festivals; per-site robots.txt + ToS. |
| Promoter websites | — | manual / per-site | per-site | idle | Category: promoter event listings; per-site compliance. |
| Resident Advisor | — | manual / per-site | n/a | idle | GraphQL endpoint exists but is unofficial; respect robots.txt and ToS. |
| Songkick | API | via API | fair use | idle | Needs API key (request access). |
| Venue calendars | — | manual / per-site | per-site | idle | Category: club/venue event calendars; per-site compliance. |
label
| Source | API | Automated access | Rate limit | Status | Notes |
|---|---|---|---|---|---|
| Afro House labels | — | manual / per-site | per-site | idle | Category: Afro House labels (e.g. MoBlack, Get Physical AfroBros, Offering Recordings). |
| Anjunadeep | — | manual / per-site | per-site | idle | Roster + demo policy. |
| Defected Records | — | manual / per-site | per-site | idle | Roster + release pages. |
| Glitterbox | — | manual / per-site | per-site | idle | Defected sub-brand. |
| Kaoz Theory | — | manual / per-site | per-site | idle | Kerri Chandler label. |
| Nervous Records | — | manual / per-site | per-site | idle | NYC house catalog. |
| Spinnin Deep | — | manual / per-site | per-site | idle | Spinnin sub-label. |
| Strictly Rhythm | — | manual / per-site | per-site | idle | NYC house catalog. |
| Toolroom Records | — | manual / per-site | per-site | idle | Roster + demo policy. |
press
| Source | API | Automated access | Rate limit | Status | Notes |
|---|---|---|---|---|---|
| Boiler Room | — | manual / per-site | n/a | idle | Set/interview links; manual or via YouTube. |
| DJ Mag | — | manual / per-site | n/a | idle | Press mentions / Top 100; manual citation. |
| Mixmag | — | manual / per-site | n/a | idle | Press mentions; manual citation. |
| Wikipedia | API | ✅ allowed | fair use | ok · June 22, 2026 | MediaWiki API. |
tracklists
| Source | API | Automated access | Rate limit | Status | Notes |
|---|---|---|---|---|---|
| 1001Tracklists | — | manual / per-site | n/a | idle | No public API and strict ToS; use only with permission / manual. |
reference
| Source | API | Automated access | Rate limit | Status | Notes |
|---|---|---|---|---|---|
| AllMusic | — | manual / per-site | n/a | idle | No public API; reference only. |
social
links
community
| Source | API | Automated access | Rate limit | Status | Notes |
|---|---|---|---|---|---|
| Reddit communities | API | via API | per Reddit API terms | idle | r/house, r/DeepHouse etc. via official API. |
web
| Source | API | Automated access | Rate limit | Status | Notes |
|---|---|---|---|---|---|
| Artist websites | — | manual / per-site | per-site | idle | Category: official artist sites — booking/management/contact links. |
| Booking agency websites | — | manual / per-site | per-site | idle | Category: agency rosters + booking contacts. |
| Label websites | — | manual / per-site | per-site | idle | Category: official label sites — roster/demo/contact. |
search
| Source | API | Automated access | Rate limit | Status | Notes |
|---|---|---|---|---|---|
| Google Search | API | via API | quota/day | idle | Programmable Search Engine / SerpAPI with advanced operators for discovery. |