At NewtonX we are building the World’s Best Knowledge Access Platform. We connect our clients (mostly Fortune 500 companies) with subject-matter experts possessing highly specific, hard to find knowledge, across all industries. Our core technology consists of knowledge graph based expert search engine, global knowledge marketplace, automated expert outreach platform, and a full-spectrum client support system.
Developing and enhancing these key systems and platforms require strong data engineering expertise to solve a wide variety of exciting data problems. We are seeking seasoned and passionate data engineers with a desire to join a small but fast growing team (< 40) to get in early and influence our data technologies.
Build reliable data pipelines to clean, aggregate, and transform large volumes of data from multiple sources.
Develop versatile software components to extract useful information from various unstructured or semi-structured text data.
Implement advanced search functionalities and improve the efficiency of search indexing.
Work closely with data scientists to develop, test and iterate data models and algorithms.
Contribute to company-wide data privacy compliance efforts.
Minimum 6 years of experience in the field
Extensive experience in building large scale data pipelines with mainstream big data stack.
Strong expertise in extracting useful information from unstructured and semi-structured text data.
Strong software development skills and highly proficient with Java is a plus.
Professional working experience with Elasticsearch, Apache Beam, Spark, and GCP Dataflow a big plus.
Strong expertise in NLP or Text Mining is also a big plus.
Bachelor's degree or greater in relevant field of study