Senior Software Engineer (DevOps)

Senior Software Engineer (DevOps)

What you'll do at

This position is part of the Walmart Labs SEM Engineering team. SEM team is in charge of optimizing paid and free search for We are tasked with optimizing ad spend on RoAS while making sure as many items from our catalogue as practical get conversion, while doing it at scale, meaning, the traffic quality should be maintained regardless of amount of ad spend. We use data science methods to dynamically modify keyword and product ad bids across top search engine providers. The team is responsible for building data pipelines and API integrations enabling us to do so, with concerns of SLA and data quality squarely in focus, as well as building internal tool to provide our business partners control and visibility of SEM operations. Our challenge is immense. Presently, the catalog is expanding by 10s of thousands of items daily and keyword universe is growing by 10s of thousands monthly.

We are a highly motivated group of Big Data Geeks, Data Scientists and Applications Engineers, working in small agile groups to solve sophisticated and high impact problems. We are building smart data systems that ingest, model and analyze massive flow of data from online and offline user activity. We use cutting edge machine learning, data mining and optimization algorithms underneath it all to analyze all this data on top of Hadoop and Spark.

As a Software Engineer (DevOps), you will have the opportunity to work closely with our data scientists, business partners and systems engineers to envision and power the tools & infrastructure for this complex ecosystem. The ideal candidate must be able to wear multiple hats (SysEng, NetEng, DBA, Dev, etc.), must be detail oriented, have superior verbal and written communication skills, strong organizational skills, able to juggle multiple tasks at once, able to work independently and can maintain professionalism under pressure. You must be able to identify problems before they happen with various monitoring and logging solutions to detect and prevent outages. You must be able to accurately prioritize projects, make sound judgments, work to improve the customer experience, and get the right things done.

What you'll do:

  • Assist in designing, transitioning and deploying of applications to various public and private clouds.
  • Develop tools and framework to improve operational efficiency and anomaly detection.
  • Design and operate the environment to test application resiliency to infrastructure instabilities.
  • Responsible for application configurations in different cloud environments
  • Proactively monitor, identify, and escalate issues or root causes of systemic issues
  • Enable data scientists, business and product partners to fully leverage our platform
  • Be the consummate team player with a demonstrable ability to learn quickly.
  • Amazing ability to get stuff done.
  • Follows the industry trends in the online world

Minimum Qualifications

  • BS degree in Computer Science or a related technical field and two years experience
  • Experience in bootstrap production and non-prod environments in public/private clouds
  • Experience with CI/CD and related tools (e.g. Git, Gerrit, GitHub, Maven, Ant, Jenkins, Puppet) in Dev, QA, Staging, and Prod environments
  • Extensive experience with Linux, particularly RedHat/CentOS, SSH, DNS, etc.
  • Experience with setting up monitoring and alerting tools, e.g. Prometheus, PagerDuty, Splunk
  • Understand the metric requirements of system/application health
  • Experience writing and maintaining tools and scripts to support automation and operations, e.g. Unix Shell scripts, Ansible, etc.
  • Solid knowledge of Unix systems with the ability to troubleshoot issues in complex, distributed, multi-tier architectures.
  • Worked in SaaS/PaaS companies as DevOps engineer
  • Passionate about operational excellence and documentation
  • Good written and verbal communication skills

Preferred Qualifications

  • Experience with Kubernetes, Docker.
  • Experience with troubleshooting and performance tuning in JVM, MySQL, MongoDB, Cassandra
  • Experience with setting up a whole network infrastructure, configuring and troubleshooting networking issues
  • Experience in secure, scalable and highly available online services
  • Experience collaborating with multiple teams
  • Experience in Big Data applications
  • Accepting the "infrastructure as a code" and "immutable infrastructure" concepts

#LI- 121540801_MO1

About Walmart Labs

Imagine working in an environment where one experiment can catapult an entire industry toward a smarter future. That’s what we do at Walmart Labs. We’re a team of 5,000+ software engineers, data scientists, designers and product managers within Walmart, the world’s largest retailer, delivering innovations that improve how our customers shop and our enterprise operates.

Hello, Silicon Valley

You don’t have to choose between your career and your lifestyle in Silicon Valley. Here, you can have both.

Discover Silicon Valley
Silicon Valley
View of Silicon Valley from the hills after a passing storm

All the benefits you need for you and your family

  • 100% coverage for in network preventative care
  • Retirement Plan
  • Vision Plans
  • Dental Plans
  • Exclusive Discounts

Recently viewed jobs