EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.


DESCRIPTION


We are looking for Data Architect any level for data-driven projects. Together we design and drive lots of solutions which generate value from data, taking advantage of scalable platforms, cutting-edge technologies, and machine learning algorithms.

Set of used technologies is very wide, so any technology background of the Data Architect is acceptable. We provide a solid architecture framework, educational programs, and strong SA community to support you in a deep dive to data domain.

Some architectural areas we are focusing on:

• Data Processing Architecture
• Streaming Architecture
• Data Platform Operations
• Metadata Management Architecture
• Cloud Data Services Architecture
• ML and MLOps Architecture
• Data Warehouse Architecture
• Data Management
• Business Intelligence Solutions
• Data Integration Architecture
• Data Security Architecture

Some examples from tool set/technology stack we are using:

• Clouds: AWS, Azure, GCP
• Distributed data processing & ETL Frameworks: Apache Spark (and related cloud specific technologies such as AWS EMR, GCP DataProc, Azure HD Insight), GCP DataFlow, AWS Glue, Databricks
• Distributed Environments: Kubernetes, Docker, AWS ECS, Google Kubernetes Engine, Azure Kubernetes Services
• Analytical Data Warehousing: Snowflake, AWS Redshift, Azure Synapse, GCP BigQuery
• Relational Databases: PostgreSQL, AWS SQL DB, GCP Cloud SQL
• Lightweight/Serverless Compute: AWS Lambda Functions, GCP Cloud Functions, Azure Functions
• No-SQL/Specialized Databases: Cassandra, MongoDB, Azure Cosmos DB, GCP BigTable, Redis (including cloud analogs)
• Data Catalogs & Metadata Management: Collibra, Alation, Informatica,Azure Purview, Google Dataplex
• Integration & flow management: AWS Step Functions, Airflow/ GCP Cloud Composer, Azure Data Factory, Kafka Connect
• Data Streaming: Kafka, AWS Kinesis, GCP Pub/Sub, Azure Event Hub
• Object storages: S3, ADLS, GCS, HDFS, Minio
• Search platforms: Solr, ElasticSearch
• ML: MLflow, Kubeflow, AWS Sagemaker, Azure ML, GCP AI Platform
• Data Visualization: Power BI, Tableau, QlikView, Spotfire, Jupyter
• Platform Operations: IaaC (Terraform, AWS CloudFormation, Azure DevOps etc.), IaM (Azure AD, AWS Cognito, etc.), monitoring (Prometheus, Splunk, Azure Monitor, etc.), CI/CD (Jenkins, GCP Cloud Build, etc.), Cloud cost managment, secuity & networking tools
• Programming Languages: Java, Scala, Python

Is a Remote Job?
No

Since 1993, EPAM Systems has leveraged its advanced software engineering heritage to become the foremost global digital transformation services provider – leading the industry in digital and physical...

Apply Now