Expert Systems Engineer, Devops

Expert Systems Engineer, Devops

What you'll do at

This position is part of the Walmart Labs Advertising Technology team. This is a high-impact, fast-moving team working on greenfield projects with the backing of a Fortune 1 company. Our mission is to make digital advertisements more effective through low-latency serving, precise targeting, cutting-edge measurement and optimization technologies leveraging the trove of big data within the Walmart ecosystem. We are a highly motivated group of Big Data Geeks, Data Scientists and Applications Engineers, working in small agile groups to solve sophisticated and high impact problems. We are building smart data systems that ingest, model and analyze massive flow of data from online and offline user activity. We use cutting edge machine learning, data mining and optimization algorithms underneath it all to analyze all this data on top of Hadoop and Spark. The team also operates an end to end advertising platform that includes a scalable ad service that serves hundreds of millions of impressions each day, sophisticated ad matching algorithms, real-time reports, self-service interface for end to end program management etc.

As a Senior Software Engineer (DevOps), you will have the opportunity to work closely with our data scientists, business partners and systems engineers to envision and power the tools & infrastructure for this complex advertising ecosystem. The ideal candidate must be able to wear multiple hats (SysEng, NetEng, DBA, Dev, etc.), must be detail oriented, have superior verbal and written communication skills, strong organizational skills, able to juggle multiple tasks at once, able to work independently and can maintain professionalism under pressure. You must be able to identify problems before they happen with various monitoring and logging solutions to detect and prevent outages. You must be able to accurately prioritize projects, make sound judgments, work to improve the customer experience, and get the right things done.


What you'll do:



  • Assist in designing, transitioning and deploying of applications to various clouds

  • Develop tools and framework to improve operational efficiency and anomaly detection

  • Design and operate the environment to test application resiliency to infrastructure instabilities

  • Responsible for application configurations in different cloud environments

  • Proactively monitor, identify, and escalate issues or root causes of systemic issues

  • Enable data scientists, business and product partners to fully leverage our platform

  • Be the consummate team player with a demonstrable ability to learn quickly. Amazing ability to get stuff done.

  • Follows the industry trends in the online world

Minimum Qualifications


  • BS degree in Computer Science or a related technical field and five years experience

  • Experience in 24x7 operations, overseeing sites with constant high traffic

  • Experience in bootstrap production and non-prod environments in public/private clouds

  • Experience with CI/CD and related tools (e.g. Git, Gerrit, GitHub, Maven, Ant, Jenkins) in Dev, QA, Staging, and Prod environments

  • Extensive experience with Linux, particularly RedHat/CentOS, SSH, DNS, etc.

  • Experience with setting up monitoring and alerting tools, e.g. Prometheus, PagerDuty, Splunk

  • Understand the metric requirements of system/application health

  • Experience with setting up, configuring, and managing RDBMS and data stores such as Solr, Casandra, etc.

  • Experience writing and maintaining tools and scripts to support automation and operations, e.g. Unix Shell scripts, Ansible, etc.

  • Solid knowledge of Unix systems with the ability to troubleshoot issues in complex, distributed, multi-tier architectures.

  • Worked in SaaS/PaaS companies as DevOps engineer

  • Passionate about operational excellence and documentation

  • Good written and verbal communication skills

Preferred Qualifications


  • Experience with troubleshooting and performance tuning in JVM, MySQL, MongoDB, Cassandra

  • Experience with setting up a whole network infrastructure, configuring and troubleshooting networking issues

  • Experience in secure, scalable and highly available online services

  • Experience collaborating with multiple teams

  • Experience in Big Data applications

  • Accepting the "infrastructure as a code" and "immutable infrastructure" concepts

About Walmart Labs

Imagine working in an environment where one experiment can catapult an entire industry toward a smarter future. That’s what we do at Walmart Labs. We’re a team of 5,000+ software engineers, data scientists, designers and product managers within Walmart, the world’s largest retailer, delivering innovations that improve how our customers shop and our enterprise operates.

Hello, Silicon Valley

You don’t have to choose between your career and your lifestyle in Silicon Valley. Here, you can have both.

Discover Silicon Valley
Silicon Valley
View of Silicon Valley from the hills after a passing storm

All the benefits you need for you and your family

  • 100% coverage for in network preventative care
  • Retirement Plan
  • Vision Plans
  • Dental Plans
  • Exclusive Discounts

Recently viewed jobs