Senior Engineer Manager - DevOps Infrastructure - ...

Senior Engineer Manager - DevOps Infrastructure - Walmart Labs

  • Location RESTON, VA
  • Career Area Engineering
  • Job Function Engineering
  • Employment Type Full Time
  • Position Type Salary
  • Requisition 1040142BR

What you'll do at

At Walmart Labs we’re reinventing the world’s leading retail platform, leveraging our unique strengths to deliver the best customer experience wherever our customers shop. Imagine an environment where one line of code, one experiment, or one idea has the power to catapult an entire industry towards a smarter future. Better yet, imagine if that power could be yours, every day. That’s what we do at Walmart Labs.
While we are tech-empowered, we are people-led. We’re a team of 4,000+ software engineers, data scientists, designers and product managers within Walmart and across the world, delivering innovations that transform how our customers shop and how our enterprise operates. Our technologists solve some of the most complex problems at Walmart, building solutions that impact hundreds of millions of people for the world’s largest retailer.

What You'll Do:

· Assist in designing, transitioning and deploying of applications to various clouds
· Develop tools and framework to improve operational efficiency and anomaly detection
· Design and operate the environment to test application resiliency to infrastructure instabilities
· Responsible for application configurations in different cloud environments
· Proactively monitor, identify, and escalate issues or root causes of systemic issues
· Participate in the weekly rotating shift for level 1 support
· Enable data scientists, business and product partners to fully leverage our predictive analytics platform
· Be the team and technical lead with a demonstrable ability to learn quickly. Amazing ability to get stuff done.
·Follows the industry trends in the online world

Position Summary:

As a Senior Engineer Manager (SRE), you will have the opportunity to work closely with our data scientists, business partners and systems engineers to drive the definition and implementation of the next generation of applications and analytics tools for our predictive intelligence platform. The ideal candidate must be able to wear multiple hats (SysEng, NetEng, DBA, Dev, etc.), must be detail oriented, have superior verbal and written communication skills, strong organizational skills, able to juggle multiple tasks at once, able to work independently and can maintain professionalism under pressure. You must be able to identify problems before they happen with various monitoring and logging solutions to detect and prevent outages. You must be able to accurately prioritize projects, make sound judgments, work to improve the customer experience, and get the right things done.

Minimum Qualifications

  • 3 years of supervisory experience.
  • Bachelor's degree in Information Technology, Computer Science, or related field and 6 years experience in information technology or related field within the past 10 years OR 8 years experience in information technology or related field within the past 10 years OR Master's degree in Information Technology, Computer Science, or related field and 4 years experience in information technology or related field within the past 10 years.

Preferred Qualifications

Experience in 24x7 operations, overseeing sites with constant high traffic
·Experience in bootstrap production and non-prod environments in
public/private clouds, Openstack experience is a plus.
·Experience with continuous integration, delivery, and related tools (e.g. Git,
Maven, Ant, Jenkins) in Dev, QA, Staging, and Prod environments
·Extensive experience with Linux, particularly RedHat/CentOS, SSH, DNS,
EMAIL, etc.
·Experience with setting up monitoring tools, e.g. Nagios, GrayLog, Logstash,
Monit, PagerDuty.
·Understand the metric requirements of system/application health
·Experience with setup, configure, and manage RDBMS, NoSQL,
Elasticsearch, Kibana, and big data severs
·Experience writing and maintaining tools and scripts to support automation
and operations, e.g. Unix Shell scripts, Ansible, etc.
·Solid knowledge of Unix systems with ability to troubleshoot issues in
complex, distributed, multi-tier architectures.
·Knowledge and experience with HAProxy, Tomcat, Node.js, etc.
·Worked in SaaS/PaaS companies as DevOps engineer
·Passionate about operational excellence and documentation
·Good written and verbal communication skills
·Experience with troubleshooting and performance tuning in JVM, MySQL,
MongoDB, Cassandra, Elasticsearch
·Experience with setting up a whole network infrastructure, configuring and
troubleshooting networking issues
·Experience in secure, scalable and highly available online services
·Experience in data visualization
·Experience collaborating with multiple teams
·Experience in Big Data applications

About Walmart

At Walmart, we help people save money so they can live better. This mission serves as the foundation for every decision we make, from responsible sourcing to sustainability—and everything in between. As a Walmart associate, you will play an integral role in shaping the future of retail, tech, merchandising, finance and hundreds of other industries—all while affecting the lives of millions of customers all over the world. Here, your work makes an impact every day. What are you waiting for?

Hello, D.C. Metro

National landmarks, museums, renowned restaurants—the D.C. Metro is a hub of activity and culture. It’s also a prime location for the future of tech.

Discover D.C. Metro
DC Metro
Aerial view of the Jefferson Memorial with downtown Washington DC in the background

All the benefits you need for you and your family

  • Multiple health plan options
  • Vision & dental plans for you & dependents
  • Associate discounts in-store and online
  • Financial benefits including 401(k), stock purchase plans and more
  • Education assistance for Associate and dependents

Recently viewed jobs