Data Engineer [Multiple Positions Available]
J.P. Morgan
DESCRIPTION:
Duties: Perform solution architecture, and design and develop data ingestion processes for Machine Learning pipelines. Evaluate new and current technologies using emerging model feature engineering standards and frameworks. Provide technical guidance and direction to support the business and its technical teams, contractors, and vendors. Contribute to the engineering community as an advocate of firm-wide data frameworks, tools, and practices in the AI and ML Development Life Cycle. Influence peers and project decision-makers to consider the use and application of leading-edge technologies. Apply advanced analytics techniques to identify, analyze, and interpret trends or patterns in complex data sets enabling superior machine learning model outcomes. Innovate new ways of managing, transforming, and validating Machine learning model outputs. Establish and enforce guidelines to ensure consistency, quality, and completeness of Machine learning feature data assets. Act as the coach and mentor to team members on their assigned project tasks. Develop a cohesive MLOps and DataOps pipeline to ensure scalability, reliability and resiliency. Conduct product work reviews with team members.
QUALIFICATIONS:
Minimum education and experience required: Bachelor's degree in Electronic Engineering, Computer Engineering, Computer Science or related field of study plus 7 years of experience in the job offered or as Data Engineer, IT Project Architect, IT Consultant, Application Developer, Software Engineer, or related occupation.
Skills Required: This position requires seven (7) years of experience with the following: utilizing Data Lake and Delta Lake Management Architecture for AI and ML enablement; designing and implementing data lake management architecture for AI-driven solutions, including both traditional Data Lakes and Delta Lakes for optimized data storage and processing; technology, big data analysis, and ML features domain consulting; analyzing, designing, and conducting proof of concepts (POC) to validate architectural decisions and data strategies; delivering incremental solutions using an Agile approach, ensuring continuous integration and delivery; implementing transformations on big data platforms, Python, PySpark and Scala programming languages, including NoSQL databases, Teradata, DB2, Hadoop, Snowflake and SAS BI tools with a focus on leveraging Delta Lake for ACID transactions and scalable data processing. This position requires five (5) years of experience with the following: utilizing Databricks and AWS and Azure data processing tools to support ML model training; utilizing data transformation tools including AWS Glue, EMR, EKS, Redshift, MSK (Managed Streaming for Apache Kafka), AWS Kinesis, and Databricks for collaborative data engineering and machine learning workflows; handling terabyte- sized datasets with multi-threading in PySpark on cloud platforms, utilizing Databricks for enhanced performance and scalability; utilizing cloud computing platforms including Azure or AWS, integrating Databricks for seamless data processing and analytics. This position requires three (3) years of experience with the following: using event-driven architecture (EDA) and real-time streaming to identify fraud proactively; utilizing event-driven architecture using event streaming with Apache Kafka and AWS MSK for real-time feature engineering; developing end-to- end pipelines using Python and PySpark to support Data Lake, Data warehouse and ML models, leveraging Databricks for model training and deployment. This position requires one (1) year of experience with the following: applying data exploration techniques to analyze customer behavior to find actionable domain specific insights utilizing algorithms to explore large collection of customer transactions and reveal hidden relationships among entities, ensuring comprehensive data insights; maintaining governance, reproducibility, and scalability of models, while optimizing workflows for efficiency. This position requires any amount of experience with the following: utilizing AWS Kinesis for real-time data streaming and processing, to ensure low-latency and high-throughput data pipelines.
Job Location: 8181 Communications Parkway, Plano, TX 75024.
Chase is a leading financial services firm, helping nearly half of America’s households and small businesses achieve their financial goals through a broad range of financial products. Our mission is to create engaged, lifelong relationships and put our customers at the heart of everything we do. We also help small businesses, nonprofits and cities grow, delivering solutions to solve all their financial needs.
We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs. Visit our FAQs for more information about requesting an accommodation.
Equal Opportunity Employer/Disability/Veterans
Our Consumer & Community Banking division serves our Chase customers through a range of financial services, including personal banking, credit cards, mortgages, auto financing, investment advice, small business loans and payment processing. We’re proud to lead the U.S. in credit card sales and deposit growth and have the most-used digital solutions – all while ranking first in customer satisfaction.
Perform solution architecture, and design and develop data ingestion processes for Machine Learning pipelines.