Data Engineer
Overview
We-'re building Affine to solve one of
- ' biggest pain points: legal research. Lawyers spend countless hours searching through scattered sources — databases, cloud services, local files — using outdated search tools, and then manually compiling and synthesizing everything into client deliverables.
Affine uses recent advances in generative AI to unify these fragmented sources and enable search that actually understands what lawyers need. From initial query to analysis to final documents, our platform supports the entire research workflow.
You-'ll be joining a
- seed stage
- up with a team of 11 that combines expertise in AI, data science, software engineering, and law. We are already working with leading law firms and organizations across Portugal to transform how legal research is done.
What will you do
What will you do:
You-'ll architect and own the data infrastructure that powers Affine-'s legal research platform. You-'ll build systems that lawyers rely on daily for critical research, ensuring data is always current and accurate.
You will
- Build custom API integration with law firms document management systems (i
Manage, Share
Point, Net
Documents), mirroring complex permissions structures and syncing in
- time with change detection - Integrate new legal datasources, collaborating with legal experts to understand requirements and build custom scrapers and pipelines
- Design and implement monitoring systems to track data freshness, quality, and pipeline health across both public legal sources and private client document repositories
- Scale and manage our pipeline as we add new data sources, scale ingestion volume, and support more concurrent users
Who you are
You are someone with experience building production pipelines and working with complex data sources, while maintaining high standards for quality and reliability.
You understand that legal professionals depend on accurate, timely data for critical decisions, and you take that responsibility seriously. At the same time, you know that building at a startup means shipping fast, making pragmatic tradeoffs, and sometimes choosing the solution that works today over the perfect architecture that might work tomorrow.
Qualifications
- Built and scaled production data pipelines from scratch, handling everything from scraping and ETL to storage and monitoring
- Strong Python backend development experience with a focus on writing clean and maintainable code
- Proven track record deploying and managing services on cloud platforms (Azure, AWS) in production environments
- Hands-on experience with orchestration tools (Dagster, Airflow, Modal, or Prefect)
- Production experience with database technologies (Postgre
SQL, Vespa, Mongo
DB or equivalent) - Solid experience with infrastructure technologies (Kubernetes, Docker or equivalent)
Nice-to-haves
- Experience integrating enterprise document management systems (i
Manage, Share
Point, etc. ) with complex permission handling into data pipelines - Knowledge of building AI-powered data pipelines (embeddings, vector search, LLM preprocessing)
- Background working in regulated industries where data accuracy and audit trails
- 't optional
What we offer
- Competitive salary, according to experience: €30-45k gross/year
- Stock options
- 22 days of paid vacation + your birthday off
- Other perks such as flexible working location and schedule, health insurance and equipment budget
- Opportunity to shape and own the entire data architecture at a
- moving
- stage startup
Seniority level
- Mid-Senior level
Employment type
- Full-time
Job function
- Information Technology
Industries
- Software Development
- Informações detalhadas sobre a oferta de emprego
Empresa: Affine by NeuralShift Localização: Lisboa
Lisboa, Lisboa, PortugalPublicado: 28. 9. 2025
Vaga de emprego atual
Seja o primeiro a candidar-se à vaga de emprego oferecida!