Job Description
Paperpile is seeking a Backend Engineer to work on data-heavy systems that support a literature database of over 250 million academic papers. In this role, you will build and optimize data ingestion pipelines, enhance search functionalities, manage PDF processing, and develop clean APIs.
**Requirements:**
- Strong backend engineering background with experience in production environments.
- Proficient in deploying and operating services on AWS.
- Experience designing and maintaining data ingestion pipelines from diverse sources.
- Familiarity with web scraping and third-party data sources/APIs.
- Proficient in Node.js and TypeScript (Java or Python experience is acceptable).
- High standards for data quality, including correctness and consistency.
- Solid understanding of full-text search systems, including indexing strategy and query optimization.
- Proficient in building reliable REST APIs.
**Preferred Experience:**
- Familiarity with academic publishing formats (e.g., PubMed, Crossref, arXiv).
- Experience with PDF processing pipelines at scale.
- Knowledge of LLM-based document processing or ML pipelines for extracting structured data.
- Experience with large-scale web crawling and scraping.
**Compensation:**
- Base salary: €60,000 – €90,000, depending on experience.
- Bonus/equity program available.
To apply, please mention the word **NOURISH** and tag RMzUuMTcwLjgyLjMw to confirm you've read the job post thoroughly.