As a Senior Software Engineer (Reliability Engineering) you are responsible for working with Walmart’s ecommerce state of the art fulfillment center warehouse management system as part of the Supply Chain Technology organization. The initiatives require ensuring smooth functioning of the WMS system and creating a great customer order fulfillment experience. We are looking to bring more intellectually curious engineers who are passionate about technology and creating engineering solutions to operations problems such as optimizing existing systems, building monitoring infrastructure and eliminating work through automation I and find innovative ways that reduce time spent on manual operations and proactively identify potential downtimes.

Responsibilities include:

• Demonstrate up-to-date expertise in Software Engineering and apply this to the development, execution, and improvement of action plans.
• Understanding of incident management processes and procedures.
• Technical understanding of core infrastructure, cloud services, platforms and micro-services
• Ability to understand and capture key data from log
• Ability to effectively triage - be able to detect and determine symptom vs cause.
• Analyze trends to pro-actively prevent incidents.
• Focus on immediate restoration vs. root cause.
• Create and maintain procedural documentation.
• Identify and drive continuous improvement efforts to reduce waste (eliminate, automate or streamline).
• Build tools to improve visibility, pro-actively detect issues and restore system availability.
• Strong focus on collecting and inferring metrics.
• Analyzes systems and makes recommendations to prevent possible problems.
• Takes lead on issue resolution activities using knowledge of complex and company-wide systems.
• Perform build, deployment and continuous integration processes to move the code and configurations from local development environments to QA & Production environments.
• Work as Level 2-production support engineer on a rotation-basis to help Level 1 production support team for any production issue where engineering help is required.
• Responsible for production environment health as first priority, enabling automated monitoring and alerting to meet SLAs.
• Clear communication skills.

Minimum Qualifications

• Bachelor's Degree in Computer Science or related field and 4-6 years or Master's Degree in Computer Science or related field and 3-5 years of experience
• Proven industry experience with large scale distributed systems
• Solid experience with object-oriented and/or event driven systems
• Strong Java programming experience
• Scripting skills in at least one of the following: Shell, Perl, Python, Bash, or Ruby
• Extensive experience troubleshooting production systems
• Building tools for monitoring and alerting

Preferred Qualifications

• Knowledge of SQL/No-SQL and database technologies (Oracle, Cassandra, Hive)
• Strong aptitude for writing efficient code
• Attitude to thrive in a fun, fast-paced start-up like environment
• Experience in production system operations (logging, telemetry, alerting etc.)
• Excellent communication and problem-solving skills
• Has ambition and vigor to add value to a rapidly growing engineering team

About Walmart Labs

Imagine working in an environment where one experiment can catapult an entire industry toward a smarter future. That’s what we do at Walmart Labs. We’re a team of 4,000+ software engineers, data scientists, designers and product managers within Walmart, the world’s largest retailer, delivering innovations that improve how our customers shop and our enterprise operates.

