job summary: This is an amazing opportunity with PWC. PricewaterhouseCoopers is a multinational professional services network of firms, operating as partnerships under the PwC brand. PwC ranks as the second-largest professional services network in the world and is considered one of the Big Four accounting firms. We are looking for some to be part of a site reliability engineering team- team of 22, continuing level III support for PwC digital products. To work with On-shore/off-shore resources and who is multi-disciplined individual wirh experience in - cloud computing (Azure), CI/CD, infrastructure as code, sys admin, network monitoring, etc- as many as possible/combo. To be able to managing and continually improving platform infrastructure and applications with high reliability, resiliency, performance & quality, and faster time-to-market taking a holistic view of system health into account. location: Tampa, Florida job type: Contract salary: $80 - 95 per hour work hours: 8am to 5pm education: Bachelors responsibilities: Responsibilities: Demonstrates extensive abilities and/or a proven record of success in the following areas:Providing SRE support for multiple distributed software applications (client-facing - internal & external);Managing and continually improving platform infrastructure and applications with high reliability, resiliency, performance & quality, and faster time-to-market taking a holistic view of system health into account;Gathering and analyzing metrics from both systems and applications for performance tuning and fault finding;Partnering with development teams to improve services through rigorous testing and release procedures meeting security, compliance & performance requirements;Participating in systems design, platform management, and capacity planning. Ensure that platforms are designed with "operability " in mind;Pursuing the discovery of system faults throughout the application lifecycle - before & after release;Defining, Implementing and being accountable for Velocity & Reliability (SLIs, SLOs, Error Budgets);Creating & supporting sustainable systems and services through automation (to drive the problems away not just mere automation) and uplifts for infrastructure, testing, failover solutions, failure mitigation, etc.;Writing, updating, and using documentation, including runbooks/playbooks; and,Using Chaos Engineering to test the robustness of the systems and applications. Qualifications 5+ years professional experience with various flavors of Linux and/or Windows5+ years experience in supporting and troubleshooting full stack applications (monolithic and microservices), infrastructure and legacy applications (root cause analysis through identifying, analyzing and remediating service(s) performance and availability issues to ensure maximum service uptime and availability)5+ years experience with cloud computing technology and its concepts (Azure, AWS, GCP)3+ years experience in balancing service reliability, metrics, sustainability, technical debt, and operational toil for live services running at scale3+ years experience with container technologies and orchestration (Docker, Kubernetes-AKS, EKS, GKE)3+ year implementing DevOps practices at scale Demonstrates extensive abilities and/or a proven record of success in the following areas: Experience in one or more of the following: Go, Python, Ruby, Java, Perl, Shell, or Powershell;Experience with CI/CD tool chain- Git, Jenkins, Azure DevOps. Veracode, SonarQube, JFrog Artifactory;Experience with IaC with Terraform, ARM templates, and/or AWS CloudFormation templates;Experience with configuration management tools like Ansible, Puppet and/or Chef;Experience with DBaaS/Managed Cloud database technologies such as CosmosDB, DynamoDB, Managed SQL (RDS, SQL Database), In-memory (Cache for Redis, ElastiCache);Experience with application performance monitoring tools (AppDynamics, Azure application insights, Dynatrace, or Datadog) and log management tools (Azure Monitor's log analytics, Elastic Stack, and/or Splunk) defining, creating and configuring metrics for dashboards and alerts;Experience with distributed storage technologies like Azure (Blob, Files, Tables), S3, NFS, HDFS;Experience with Web server technologies- HTTP, Nginx, Apache, Tomcat;Experience in Kafka, Azure Event hubs or similar message queue technologies;Experience with Service mesh platforms such as Istio, Hashicorp Consul;Experience with Secrets Lifecycle management (Azure Keyvault, Hashicorp Vault);Experience on minimal or near zero downtime deployments as Blue-Green, Canary, rolling upgrades, etc.;Define and implement HA, DR and rollback strategies along with the product and build teams;Possess proficiency in Networking concepts (HTTP/S, TCP/IP, DNS, Virtual Networks (VNet, VPC), Subnets, Routing, Firewalls, and Network Security, triaging packet loss etc) and knowledge on RESTful APIs;Experience with 24x7x365 monitoring, incident response and oncall support;Experience in troubleshooting that spans systems, network, and code;Experience determining & negotiating Error budgets, SLIs, SLOs, and SLAs with product owners;Demonstrate systematic problem-solving approach, coupled with solid communication skills;Demonstrate the ability to work independently and as a member of a greater team, including cross-team activities; and,Experience working in Agile Scrum, Kanban methodologies in SDLC. Preferred Qualifications: Demonstrates extensive abilities and/or a proven record of success in the following areas:Demonstrating experience within development of the complete application stack inclusive of software engineering and systems engineering responsibilities (e.g. full-stack development); Requirement gathering, validation, fulfillment and change management Infrastructure operations experience including self-healing autonomy; working within regulatory frameworks such as SOX, SOC2, etc.; Experience in Chaos engineering; Experience with integration technologies like SnapLogic; Experience with a variety of databases and basic DBA skills (MySQL, SQL Server, Oracle, Postgres, Redis, Couchbase and/or Cassandra). qualifications: Experience level: ExperiencedMinimum 5 years of experienceEducation: Bachelors skills: DevOpsAzure (5 years of experience is required)Linux System Engineer (5 years of experience is required)monolithic (5 years of experience is required)microservices (5 years of experience is required)cloud computing technology (5 years of experience is required)Kubernetes (3 years of experience is required)Dockers (3 years of experience is required)service reliability (3 years of experience is preferred)Infrastructure (5 years of experience is required) Equal Opportunity Employer: Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status. At Randstad, we welcome people of all abilities and want to ensure that our hiring and interview process meets the needs of all applicants. If you require a reasonable accommodation to make your application or interview experience a great one, please contact HRsupport@randstadusa.com. For certain assignments, Covid-19 vaccination and/or testing may be required by Randstad's client or applicable federal mandate, subject to approved medical or religious accommodations. Carefully review the job posting for details on vaccine/testing requirements or ask your Randstad representative for more information