ABOUT KHAN ACADEMY
Khan Academy is a fast-paced, nonprofit startup on a mission to provide a free, world-class education for anyone, anywhere. We already reach millions of students every month and are growing rapidly. We’re building a library of world-class instructional and practice resources that empowers learners. Whether they’re studying matrices, mitosis, or multivariable calculus, we want to offer students the resources to realize that they can learn anything.
ABOUT KHAN ACADEMY INDIA
Khan Academy India aims to deliver a world class user experience that is locally relevant to learners in India and is enabled by a strong on-the-ground team and operations. Our learning system is mastery based, which allows students to master key concepts at a pace that is right for them before moving on to more challenging content. From serving under 500,000 learners in 2016, we are now serving almost 4 million learners a month across our websites, apps and youtube channels. These learners include both independent learners accessing us at home and teacher directed learners in schools. Our focus is to reach the underserved by making our content accessible in local languages and by working with large public school systems. Khan Academy is available in Hinglish, Hindi, Gujarati, Assamese, Marathi, Punjabi and Kannada.
ABOUT THE ROLE
We are looking for a data engineer who will help us build and maintain our whole data stack - From data extraction to data injection, data warehouse optimisation to data access and it’s end usage - you will see it all. The primary focus will be on choosing optimal tools to use for these purposes and then maintaining, implementing, and monitoring them. Optimally you are already a full-stack data person with strong technical skill (perfect knowledge of at least one dynamic language and SQL), a good business mindset, and data analysis competencies.
In this role you will:
- Implement a solid ETL process and manage data warehouse; Setup Real-time and batch data pipelines; Monitor performance and advise any necessary infrastructure changes
- Work with our engineering teams to improve the tools and datasets and work self-sufficiently with data pipelines (e.g. ETLs) on an as-needed basis
- Own the continued development of metrics & KPIs, including trend analyses, metrics research, self-service tooling development (e.g. Looker or Tableau), and support of business teams who use them to drive strategic & operational decisions
- Conceptualize and execute deep-dive analyses to uncover insights around our users across their lifecycle (from acquisition to usage & retention). Perform advanced exploratory analyses on large sets of data to extract insights about our teacher & learner behavior and guide our decisions on different initiatives. Mine for patterns and causal relationships, and painting the picture around how users are interfacing with our products to achieve learning outcomes
- Contribute to the design of hypothesis-driven experimentation, such as outcome measurement for campaigns and other optimization initiatives and use your expertise in experimentation (i.e. AB testing, causal inference) to measure the impact of various programs and interventions
ABOUT YOU
You are someone with:
- A willingness to roll up your sleeves and help the team get work done as we are growing
- 4+ years of hands on experience in data engineering and analytics field, ideally in an education setting
- Knowledge of advanced statistical (i.e. multiple regression, hypothesis testing) and machine learning techniques (i.e. clustering, decision tree learning, etc.) for real-world applications
- Strong SQL foundations & ability to manipulate data using R or Python
- Prior experience with the end-to-end analytics chain is a nice to have (e.g. data modeling & ETL, BI tool development and Good understanding of cloud data warehouse management systems (AWS/GCP/Azure)
- Strong verbal/written communication & data presentation skills, including an ability to effectively communicate with both business and technical teams, experience with BI tools is a plus
- Ability to work collaboratively with cross-functional teams (with the product, content, marketing, philanthropy, and analytics teams) of staff that span wide time zones (Delhi, India to California, USA) to research and improve our content and products
- Hands-on experience with scripting languages in the back end (Python/Ruby/NodeJS, etc) and JS in the frontend
- Being aware of good practices when collaborating in version control (Git, Mercurial)
- Knowledge of DBT, Apache Airflow and Docker is preferred.
PERKS AND BENEFITS
We may be a non-profit, but we reward our talented team like a for-profit.
- Competitive salaries and Meritocracy-driven, candid culture
- A fun, high-caliber team that trusts you and gives you the freedom to be brilliant
- The ability to put your talents towards a deeply meaningful mission and the opportunity to work on high-impact products that are already defining the future of education
- Remote work friendly, i.e. option to work from home; flexible schedules
LEARN MORE
HOW TO APPLY
- Attach your resume or LinkedIn URL in the space provided below.
- Complete the pre-work assignment here and submit your assignments below.
- Please submit a google drive link of your assignment
- Make sure you have enabled view access for anyone with the link.
We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, or veteran status.