Epoch AI logo

Expression of Interest: Data Lead

Epoch AI
Full-time
Remote friendly (Remote Pennsylvania US)
Worldwide
$125,000 - $200,000 USD yearly

Expression of Interest: Data Lead

Department: Epoch AI

Employment Type: Full Time

Location: Remote

Compensation: $125,000 - $200,000 / year



Description

Epoch AI is looking for a Data Lead to lead our data collection and management. You’ll own our datasets and research data collection, allowing us to inform the world about the trajectory of advanced AI. 


About the role and team

Note: We are characterizing this opening as an expression of interest because we expect there are multiple backgrounds and skillsets that could be successful in this role. We’re open to working with promising candidates to clarify the scope and responsibilities of the role in a way that would play to their unique strengths and interests.
 

Please do not include a cover letter, photograph, or headshot of yourself, or any personal information that is not relevant to the role. Applications are rolling.

Epoch AI is a leading research institute investigating trends in artificial intelligence, aiming to provide rigorous, accessible insights into AI development. A core part of our work is collecting data on key topics such as AI models and their training details, AI hardware, datacenters, AI lab resources, and more.

We are seeking a proactive Data Lead to take ownership of our data processes, supporting Epoch’s data needs from the beginnings of research all the way through to maintaining high-quality datasets to drive the visualizations on our website. 

This role is fully remote and we expect to be legally able to hire in many countries. If you are unsure whether we can hire in the country you are based in, please email [email protected]. This role is open to full-time candidates.

  • Lead systematic collection of new datasets and new data sources for our research. You would proactively work with Epoch researchers to establish production-ready schemas and search processes for new datasets, then lead the data collection process to happen on time and to a high standard.
  • Own the datasets underpinning our products, ensuring their ongoing accuracy, comprehensiveness, and relevance. You will set the roadmaps for our key datasets, balancing priorities across research needs, presentation on the website, and tractability. For example, how do we keep track of which AI models were trained with large-scale compute, as the definition of training compute evolves? How do we ensure our database receives new entries soon after their release? How do we audit new entries for accuracy?
  • Document our data processes, make them systematic, and ensure they remain valid. What AI models are included in our database? How do we find these models to add them? How can we say what level of coverage we are achieving? You'll transform ad-hoc processes into repeatable, documented systems that scale with minimal supervision.
  • Manage data contractors to efficiently collect data that requires human judgement. We often hire contractors to scale up data collection. You would be in charge of prioritizing contractor work, recruiting and managing contractors, and ensuring their work supports our projects. 
  • Use AI tools to accelerate our data pipeline, while preserving accuracy and trustworthiness. We are already automating significant parts of data collection through LLM-powered filtering of search results. How can we do more, in a way that we can trust?


What we are looking for

Requirements:
  • Systematic and detail-oriented. You are passionate about making systems that are scalable, high-quality, and robust. You never let the same mistake happen twice.
  • Ownership and proactivity. You take responsibility for ambitious projects, and figure out what needs to happen so they succeed. Then you make it happen.
  • Project and product management experience, ideally in data. You have successfully led projects that link challenging technical data to high value products or outputs. You might have experience as a Data Scientist, Project/Product Manager, a Data Engineer, or something else altogether. You likely have 5+ years of professional experience in relevant roles, although we’re open to considering more junior candidates.
  • Technical excellence and a focus on quality. You are an excellent technologist, able to rapidly get up to speed with new software, ideas, or requirements. You don’t need AI research or software engineering experience, but you would be able to productively work with experts to achieve your goals.
If you don’t tick all these boxes but think you would be a great fit, please apply anyway!


What we offer

  • Annual salary between $125,000 and $200,000 USD. 
  • Salaries are not restricted to USD, and contracts and payments are usually in local currencies. Conversions are based on one-year average exchange rates.
  • Fully remote environment, including flexible work hours and schedules.
  • For this role, we can support relocation and sponsor visas in the US and the UK. As a 501(c)(3) research nonprofit, we are H-1B cap-exempt.
  • Competitive global benefits program, including:
    • Comprehensive health insurance program, including supplemental benefits specific to a local country, as available and mandated by local law.
    • Life insurance and pension plan, if applicable in your country.
  • Generous paid time off (PTO) leave, including:
    • 30 days off per year, with no specific limit on paid time off per year. 
    • Unlimited personal and sick leave
    • Parental leave—up to 6 months of a combination of paid and unpaid parental leave during the first 2 years after child's birth or adoption.
  • A flexible and generous expense policy for you to spend on equipment and a large range of productivity tools or learning/development opportunities you might find valuable.
  • Paid work trips, including 3 staff retreats per year and relevant conferences.
  • Access to our very well-equipped offices in Berkeley, California, including paid meals, snacks, gym, and more.
  • Other benefits as allowed at the discretion of Epoch AI’s leadership and local availability.
  • A supportive and energized team that is excited about the work we are doing and our mission.

  • Please email [email protected] if you have any questions about this role or accessibility requests. 
  • While we welcome applicants from all time zones, we prefer candidates who can maintain overlap with both UTC (GMT) and UTC–8 (Pacific Time) time zones.
  • Please submit all of your application materials in English and note that we require professional-level English proficiency.
  • We prefer candidates who can travel: we hold three retreats per year to which attendance is strongly encouraged.
  • Epoch is committed to building an inclusive, equitable, and supportive community for you to thrive and do your best work. We’re committed to finding the best people for our team, so please don’t hesitate to apply for a role regardless of your age, gender identity/expression, political identity, personal preferences, physical abilities, veteran status, neurodiversity or any other background.