By continuing to use our site, you consent to the processing of cookies, user data (location information, type and version of the OS, the type and version of the browser, the type of device and the resolution of its screen, the source of where the user came from, from which site or for what advertisement, language OS and Browser, which pages are opened and to which buttons the user presses, ip-address) for the purpose of site functioning, retargeting and statistical surveys and reviews. If you do not want your data to be processed, please leave the site.

Delivering on the Promise of Synthetic Data

Replica Analytics Careers

Data Scientist

Replica Analytics is recruiting for data scientists to join our fast-growing startup. Working as part of the data science team, this role involves coordinating with external partners and clients on data synthesis projects as well as researching and implementing improvements to existing data synthesis pipelines. There are multiple roles for senior and junior data scientists. 

The work involves the use and improvement of statistical machine learning methods and deep learning methods for synthetic data generation problems. This includes working with simple tabular datasets as well as complex longitudinal and high-dimensional data. 

There will be lots of experimentation and the development of novel utility and privacy metrics to evaluate synthetic data. 

Working at Replica Analytics is a unique opportunity to:

  • Advance health care, AI and privacy at the same time – three of the hottest topics in the world today
  • Enjoy a highly interdisciplinary, innovative and supportive environment to develop valuable skills
  • Join a trusted leader in the space and a promising start-up with tremendous potential in this rapidly growing market
  • Receive competitive compensation and opportunities for rapid professional development and growth

Key Responsibilities

  • Client Projects

    • Maintain and improve existing production and quality control pipelines for synthetic data deliverables
    • Communicate and coordinate with clients on data synthesis deliveries
    • Participate in client education on data synthesis technologies

  • Research & Development
    • Contribute to the development of new technologies for data synthesis using a wide variety of machine learning methods; investigate various research topics in machine learning and statistics to determine the best method for data synthesis
    • Contribute to the implementation and testing of production and research pipelines in Python and R as well as other languages
    • Contribute to the dissemination of research results in the form of peer-reviewed papers, reports, and presentations

Minimal Requirements

  • BSc/MSc/PhD degree (or equivalent) in mathematics, statistics, computer science, or electrical engineering

  • Work experience: 1 year for candidates with a PhD / 2 years for candidates with an MSc  / 3 years for candidates with a BSc

  • Demonstrated ability for conducting statistical and machine learning research (in the form of a thesis, publications, or side projects) and to independently solve problems

  • Proficient in Python or R programming for data science (data cleaning/pre-processing, classification and regression, model evaluation, data visualization, writing and applying custom functions, parallelization)

  • Deep learning experience with PyTorch or TensorFlow would be a big plus

  • Excellent organizational and communication skills (verbal and oral)

  • Detail-oriented

  • Motivated to learn and apply new machine learning methods to solve real-life problems

Optional Requirements

  • Experience working with health care data
  • Knowledge of SAS and SAS programming would be a plus

About Replica Analytics

Replica Analytics develops software for generating synthetic data that maintains the statistical properties of real data. We enable easy, fast and effective access to high utility data that is made portable through data simulators.

Careers Contact:

If you are interested in this position with Replica Analytics, please send an email to with your resume and contact information, and we will follow-up with you.