← Back to jobs
Charlotte, NC, USA
No related jobs found
Key Responsibilities
· Splunk Integration & Configuration: Define and implement strategies for ingesting and processing application-specific telemetry (logs, metrics, and traces) into the existing Splunk environment.
· Observability Architecture: Design and maintain robust monitoring frameworks that provide actionable insights into application health, performance bottlenecks, and user experience.
· Custom Parameter Configuration: Map complex application-level parameters to Splunk dashboards, ensuring developers and SREs have granular visibility into microservices and application workflows.
· Reliability Engineering: Drive the SRE culture by automating incident response, reducing Mean Time to Detection (MTTD), and improving Mean Time to Recovery (MTTR) through proactive observability.
· Cross-Functional Collaboration: Partner with the CIS team to ensure that application-side configurations align with enterprise standards for performance, security, and data retention.
· Performance Optimization: Analyze existing Splunk queries and data ingestion pipelines to optimize for speed, cost, and relevance in an application-monitoring context.
Technical Requirements
· Splunk Proficiency: Deep expertise in Splunk (Search Processing Language - SPL), data modeling, dashboard creation, and configuration of Splunk App for Infrastructure/APM.
· Application Monitoring: Strong understanding of distributed systems, microservices architectures, and how to instrument applications for observability (e.g., OpenTelemetry, agents, SDKs).
· Cloud & Infrastructure: Familiarity with cloud-native technologies (AWS/Azure/GCP) and container orchestration (Kubernetes/Docker) is highly preferred.
· Automation: Scripting experience (Python, Bash, or Go) to automate configuration tasks and data pipelines.
· SRE Principles: Proven experience in implementing Error Budgets, Service Level Objectives (SLOs), and Service Level Indicators (SLIs)
Bachelor's degree
No related jobs found
← Back to jobs