Draft:Job Data Pool (jobdatapool.com)


JobDataPool is an open job listings data infrastructure project associated with the Job Pool ecosystem. It provides a public REST API, an OpenAPI contract, JSON Schemas, and versioned datasets for structured job listings.[1] The project's documentation describes JobDataPool as the canonical data layer for an ecosystem that separates consumer job search, transparency tooling, canonical data infrastructure, and ingestion operations across separate domains.[2]

Overview

JobDataPool is designed to expose job listings as structured data rather than only as consumer search pages. The public documentation describes the project as hosting API endpoints, machine-readable contracts, dataset releases, and architecture RFCs for downstream products, transparency surfaces, and contributor tooling.[1] The project uses the term "Open Job Data Pool" for a shared, continuously refreshed, multi-source data layer for job listings.[3]

Architecture

The Job Pool documentation defines four related surfaces with different responsibilities. In this topology, mewannajob.com is the consumer job-search product, jobpool.live is the transparency and power-user layer, jobdatapool.com is the canonical data infrastructure layer, and datapool.work is the ingestion and contributor operations layer.[2]

JobDataPool's role in that architecture is to publish stable data contracts and access paths. The RFC describes the domain as responsible for the public API, bulk dataset distribution, schema definitions, versioned data access, and developer documentation.[2]

Data and API

The JobDataPool v1 API is documented through an OpenAPI 3.1 specification. The API includes endpoints for jobs, source metadata, health checks, and launch metrics.[4] The public site describes the v1 API as no-auth and rate-limited, with documented limits on requests and rows returned per call.[1]

The project also publishes JSON Schemas for its data contracts. Its job listing schema includes fields such as listing ID, ingestion date, title, company, location, employment type, industry fields, compensation text, source URL, application link, ingest timestamps, validation date, closed-listing status, and source business URL.[5]

Dataset releases

JobDataPool publishes reviewed job listings as monthly CSV snapshots. Its dataset page states that the releases are pinned by month and backed by DVC pointers, with related JSON, Parquet, API JSON, and gzip artifacts derived from the same release path.[6] The public DVC repository for the dataset pointers is hosted on GitHub under jobpool-live/jobpool-listings-r2.[7]

As of June 2026, the dataset pointer repository included monthly DVC pointer files for April, May, and June 2026. The June 2026 pointer referenced a CSV file on Cloudflare R2 and included checksum, size, path, and changed-date metadata.[8]

The related jobpool.live site describes itself as the transparency layer for the Job Pool ecosystem, with bulk downloads, scraper documentation, contributor leaderboards, staged submissions, and operational visibility.[9] JobDataPool documentation links this transparency layer back to the canonical API, schema, dataset, and RFC materials on jobdatapool.com.[9]

References

  1. ^ a b c "JobDataPool - Open Job Data API, JSON Schemas & Versioned Datasets". JobDataPool. Retrieved 3 June 2026.
  2. ^ a b c "JPE-RFC-0002 - Job Pool Web Topology". JobDataPool. Retrieved 3 June 2026.
  3. ^ "JPE-RFC-0001 - Open Job Data Pool". JobDataPool. Retrieved 3 June 2026.
  4. ^ "JobDataPool OpenAPI specification". JobDataPool. Retrieved 3 June 2026.
  5. ^ "JobDataPool job listing JSON Schema". JobDataPool. Retrieved 3 June 2026.
  6. ^ "Download Open Job Listings CSV Datasets (Versioned)". JobDataPool. Retrieved 3 June 2026.
  7. ^ "jobpool-live/jobpool-listings-r2". GitHub. Retrieved 3 June 2026.
  8. ^ "listings-june-2026.csv.dvc". GitHub. Retrieved 3 June 2026.
  9. ^ a b "jobpool.live - Live Job Data Transparency Layer". jobpool.live. Retrieved 3 June 2026.

Content Disclaimer

Informasi ini disarikan dari Wikipedia dan disajikan kembali untuk tujuan edukasi. Konten tersedia di bawah lisensi CC BY-SA 3.0. Kami tidak bertanggung jawab atas ketidakakuratan data yang bersumber dari kontribusi publik tersebut.

  1. The information displayed on this website is sourced in part or in whole from Wikipedia and has been adapted for the purpose of restating it. We strive to provide accurate and relevant information, however:
  2. There is no guarantee of absolute accuracy. Wikipedia is an open, collaborative project that can be edited by anyone, so information is subject to change.
  3. It is not intended to constitute professional advice. The content displayed is for informational and educational purposes only. For important decisions (e.g., medical, legal, or financial), please consult a professional.
  4. Content copyright. Wikipedia is licensed under the Creative Commons Attribution-ShareAlike License (CC BY-SA). This means that content may be reused with appropriate attribution and shared under a similar license.
  5. Responsible use. Any risk arising from the use of information from this website is entirely the responsibility of the user.