HS Data Science Intern
Remote - Chicago IL, California, Florida, Illinois, Indiana, Maryland, Massachusetts, New Jersey, New York, Ohio, South Carolina, Virginia, Wisconsin, Washington D.C.
The American Medical Association (AMA) is the nation's largest professional Association of physicians and a non-profit organization. We are a unifying voice and powerful ally for America's physicians, the patients they care for, and the promise of a healthier nation. To be part of the AMA is to be part of our Mission to promote the art and science of medicine and the betterment of public health.
We continuously work to embed equity in our internal practices and are committed to increasing the diversity of our staff across all levels of the organization. We intentionally work to create the right conditions to enable our employees to feel that they can be their authentic selves and fully participate in the life of the enterprise.
We encourage and support professional development for our employees, and we are dedicated to social responsibility. We invite you to learn more about us and we look forward to getting to know you.
We have an opportunity for a remote HS Data Science Intern on our Health Solution team. As a HS Data Science Intern, you will support the expansion of the Health Solutions Data Science function through the analysis of existing AMA data assets as well as potential data acquisitions opportunities that would provide benefit to the AMA. You will conduct analyses to enhance the collection, enrichment, and management of AMA's data, to optimize current usage and enable new business uses of these data assets. In your role, you will develop consistent and timely analysis of metrics and key performance indicators, scorecards and trending reports that lead to informed business decisions and reflect the true state of AMA data assets. You will also assist in the implementation of the long-range AMA Physician Masterfile strategy and the overall data management process modernization effort, and assist in development of innovative Data Science approaches to enhance analytical capabilities of the Health Solutions Group.
RESPONSIBILITIES:
May include other responsibilities as assigned
REQUIREMENTS:
1. Be working towards a BS or MS degree in Data Science, Statistics, Analytics, Computer Science, Information Systems, or a related degree.
2. Basic analysis skills; familiar with data analysis tools and techniques, such as SAS, R, Python, SPSS, text analytics, NLP; Able to manage and integrate insights and establish monitoring around multiple internal data sources, such as AMA Masterfile, Enterprise Data Warehouse, customer database and purchased/appended data if available.
3. Ingestion, standardization, metadata management, business rule curation, data enhancement, and statistical computation against data sources that include relational, XML, JSON, streaming, REST API, and unstructured data.
4. Understanding of orchestration and scheduling tooling such as Jenkins/Airflow/Rundeck
5. Experience and interest in presenting analytic findings to business customers. Familiar with reporting and visualization tools, such as Tableau, Power BI, Business Objects, etc.
6. Familiar with SQL in the extraction and manipulation of datasets. General knowledge of transactional data processing, ETL, data warehouse, data mart, and operational reporting solutions a plus. Any experience with the following tools is desirable: Aqua Data Studio, IBM DataStage / QualityStage, Information Analyzer, Informatica Business Glossary and Powercenter.
7. Any experience with or basic understanding of newer database structures and models such as NoSQL, Hadoop, Marklogic and Cassandra, a plus. Programming skills in Python, Java highly desirable.
8. Experience and interest in deeper analysis of data subjects, potentially spanning over a timeline of several months.
9. Experience with various batch matching methodologies. Willingness to learn and work with disparate data sets of varying structures and quality and interested in creating ad hoc methodologies to facilitate matching on these data sets, often without the benefit of common keys.
10. Any knowledge/interest in implementing data management systems, ETL development, or master data management solutions is highly desirable.
What Puts You Over The Top * Some practical experience working on AWS or another cloud provider * Some practical experience developing with Apache Spark and/or Hive * Good knowledge of SQL and experience with columnar datastores * You are working on your Masters degree
The pay range for this position in Chicago IL, California, Florida, Illinois, Indiana, Maryland, Massachusetts, New Jersey, New York, Ohio, South Carolina, Virginia, Wisconsin, or Washington D.C. is $20-23hr. This is the lowest to highest salary we in good faith believe we would pay for this role at the time of this posting. An employee's pay within the salary range will be based on numerous factors including, but not limited to, relevant education, qualifications, experience, skills, geographical location and business or organizational needs.
We are an equal opportunity employer, committed to diversity in our workforce. All qualified applicants will receive consideration for employment. As an EOE/AA employer, the American Medical Association will not discriminate in its employment practices due to an applicant's race, color, religion, sex, age, national origin, sexual orientation, gender identity and veteran or disability status.
THE AMA IS COMMITTED TO IMPROVING THE HEALTH OF THE NATION