The AI Data Team in Amazon Web Services (AWS) is looking for a Data Scientist with a passion for developing innovative methods to maximize the power of natural language data. This position is an opportunity to apply your expertise in a challenging but supportive environment. The position may be remote, with a preference for Santa Clara, Seattle, or New York City.
The mission of the AI Data Team is to engineer the datasets critical to the success of AWS’s machine learning services. From chatbots to subtitles to search results and beyond, these products support dozens of languages and impact millions of people every day. They are a group of language engineers, linguists, data scientists, data engineers, and program managers, and they partner closely with the science, engineering, and product teams. AI Data Team is customer obsessed and committed to delivering results with the highest quality and integrity.
As a Data Scientist, you will start by learning the full context of critical projects for Comprehend natural language processing (NLP) services, consulting with stakeholders in science, engineering, and product teams. You will determine the appropriate metrics for data analysis and quality checks on data annotations, to ensure that the data is optimized for developing models that exceed customer expectations. You will consider domain gaps, data bias, and data noise in your analysis. You will design and write Python packages for these processes, and work with data engineering to implement them in data pipelines built to scale.
You will also work with language engineers to understand the challenges in producing or acquiring data, and in generating high quality labels. You will stay up to date on developments in the field of data-centric AI, and experiment with new techniques for data cleaning, labeling, and augmentation. You will share your results in written documents and presentations to the data team and stakeholders. You will scale these methods for ongoing data collection and annotation, collaborating with data engineering as necessary.
You will then contribute to large data team initiatives to design and build technical assets such as data warehousing and analytics tooling that support multiple programs. You will gain an understanding of data collection methods across the data team, and write code to provide metrics and insights across datasets that ensures the data is discoverable and reusable.
Key job responsibilities
- Design and write Python packages for analyzing natural language datasets, including domain gaps, data bias, and data noise
- Develop innovative techniques for data cleaning, labeling, and augmentation, and scale these methods for ongoing data collection and annotation
- Contribute to large initiatives for data warehousing and analytics tooling, writing code to provide metrics and insights across datasets that ensures the data is discoverable and reusable
Basic Qualifications
- Degree in a quantitative field such as computer science, mathematics, or computational linguistics
- 3+ years hands on experience as a data scientist in an industry setting, including statistical modeling and data visualization
- Proficiency in Python
- Experience with natural language data and NLP techniques
- Experience working with stakeholders such as product and engineering teams
Preferred Qualifications
- Experience in developing and evaluating data annotation and data quality metrics
- Experience using AWS tools and services
- Strong written and verbal communication skills, with an ability to present complex technical information in a clear and concise manner to a variety of audiences
The pay range for this position in Colorado is $136,000 – $184,000 (yr.); however, base pay offered may vary depending on job-related knowledge, skills, and experience. A sign-on bonus and restricted stock units may be provided as part of the compensation package, in addition to a full range of medical, financial, and/or other benefits, dependent on the position offered. This information is provided per the Colorado Equal Pay Act. Base pay information is based on market location.
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.
More Information
- Salary Offer 0 ~ $3000
- Experience Level Junior
- Total Years Experience 0-5
- Dropdown field Option 1