The Data Analytics Intermediate Engineer is a hands-on technical contributor who designs, builds, and maintains scalable data pipelines and infrastructure within large enterprise data environments. This role supports various analytical and operational needs, collaborating with cross-functional teams and applying solid knowledge of big data technologies, programming, and data governance practices. The engineer is expected to work independently on moderately complex tasks and communicate technical concepts clearly to both technical and non-technical stakeholders.
Key Responsibilities:
Design and implement data ingestion, transformation, and cleansing pipelines using PySpark, SQL, and Python/Java.
Work on structured and unstructured datasets stored in HDFS, Hive, Parquet, or cloud-based storage.
Optimize existing data workflows and jobs for performance, scalability, and reliability.
Support batch and streaming data processing frameworks across Big Data platforms (e.g., Hadoop, Spark, Hive, Kafka).
Integrate and process data from multiple sources including APIs, flat files, relational databases, and cloud-native services.
Apply data modeling, partitioning, and file format best practices for efficient storage and querying.
Implement monitoring, logging, and alerting for production pipelines and participate in on-call rotation if required.
Document pipeline logic, data lineage, and schema changes to ensure data transparency and auditability.
Collaborate with data analysts, data scientists, and product owners to translate business needs into scalable data solutions.
Assist in proof-of-concept efforts for new technologies and data integration strategies.
Technical Skills Required:
2–5 years of experience in a data engineering, ETL development, or big data role.
Strong programming experience in Python (or Java) for data manipulation and automation.
Advanced proficiency in SQL (window functions, joins, CTEs, optimization techniques).
Experience working with Apache Spark (PySpark) in a distributed environment.
Hands-on with Hadoop ecosystem tools (Hive, HDFS, Oozie, etc.).
Familiarity with Git, Jenkins, Airflow, or other CI/CD and orchestration tools.
Exposure to cloud platforms (AWS Glue/EMR, Azure Data Factory, GCP Dataflow) is a plus.
Knowledge of basic ML workflows (feature engineering, model inputs/outputs) is desirable but not mandatory.
Soft Skills & Communication:
Strong verbal and written communication skills; able to articulate technical concepts to business stakeholders.
Able to document processes, architecture diagrams, and data dictionaries with clarity.
Demonstrates strong interpersonal skills, working well with cross-functional teams in a collaborative Agile/DevOps environment.
Provides informal guidance or mentoring to junior developers and contributes to code reviews and technical discussions.
Proactive in identifying data quality issues, bottlenecks, and process gaps, with a problem-solving mindset.
Education:
Bachelor’s degree in Computer Science, Data Engineering, or a related discipline; or equivalent experience required.
------------------------------------------------------
Job Family Group:
Technology------------------------------------------------------
Job Family:
Data Analytics------------------------------------------------------
Time Type:
Full time------------------------------------------------------
Primary Location:
Irving Texas United States------------------------------------------------------
Primary Location Full Time Salary Range:
$76,230.00 - $106,370.00
In addition to salary, Citi’s offerings may also include, for eligible employees, discretionary and formulaic incentive and retention awards. Citi offers competitive employee benefits, including: medical, dental & vision coverage; 401(k); life, accident, and disability insurance; and wellness programs. Citi also offers paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays. For additional information regarding Citi employee benefits, please visit citibenefits.com. Available offerings may vary by jurisdiction, job level, and date of hire.
------------------------------------------------------
Anticipated Posting Close Date:
May 05, 2025------------------------------------------------------
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
An opportunity for a bilingual Spanish-speaking Part-Time Teller at Citi’s Yonkers branch to deliver exceptional customer service and support branch transactions.
Serve as a friendly and efficient bilingual teller at Citi's Merrick branch, delivering excellent client service and ensuring secure transaction processing.
Lead the Connected Commerce data strategy as Data Owner VP, driving personalized product capabilities through advanced analytics and AI/ML innovations.
Bosch is looking for a Business Intelligence Developer to enhance data management and reporting for international trade operations globally.
Contribute to Navy operations by developing robust, automated database solutions and metrics analysis as a Database Developer with ProSidian Consulting.
CNA Insurance is looking for a Data Management Analyst to support regulatory statistical reporting under a hybrid work model in Chicago, IL.
Support Allegheny Health Network in capturing detailed oncology patient data accurately for registry compliance in a full-time remote role.
Seeking an Assistant Vice President at iCapital to lead data governance initiatives and foster collaboration between business and tech teams in a hybrid work setting.
AECOM is hiring a Graduate Project Information Specialist to support data migration and management tasks remotely within their global infrastructure consulting team.
Lead data governance and quality programs at ServiceNow to ensure trusted, secure, and compliant data for enterprise-wide impact.
The Assistant Director role at IUF leads data quality efforts to enhance fundraising success through innovative management and strategic improvement of advancement data.
Lead Gurobi's data infrastructure and analytics initiatives remotely to empower smarter business decisions through optimized data systems.
Drive data quality governance as a Lead Analyst at Citi by evaluating, monitoring, and improving data assets to support strategic business decisions in Tampa.
Seeking a Sr. Data Engineer to build and maintain robust data pipelines supporting federal government analytics at CGS.
Lead 1915 South’s inaugural data warehouse and analytics infrastructure development to empower business-wide decision-making with clear data insights.
Citi’s mission is to serve as a trusted partner to our clients by responsibly providing financial services that enable growth and economic progress. Our core activities are safeguarding assets, lending money, making payments and accessing the capi...
838 jobsSubscribe to Rise newsletter