← Back to jobs
Hallmark Global Technologies Inc
Buffalo, NY, USA
No related jobs found
Responsibilities:
Education and Experience Required:
Combined minimum of 10 years’ higher education and/or work experience in systems design, management and/or architecture
10+ years of experience in Site Reliability Engineering, DevOps or system design and/or architecture similar roles.
5+ years of experience leading or managing observability initiatives.
Strong hands-on experience with monitoring tools like Kibana, Dynatrace, Datadog, or similar.
Solid understanding of observability concepts (metrics, logging, tracing, alerting) and frameworks (e.g., OpenTelemetry).
Experience with cloud environments such as AWS, Google Cloud, or Azure.
Familiarity with containerization (Docker, Kubernetes) and orchestration platforms.
Excellent problem-solving skills and ability to troubleshoot complex distributed systems.
Mid-level programming skills in Python, Jason, PowerShell, or other relevant languages.
Experience with incident response and post-mortem analysis.
Excellent communication and collaboration skills
Advanced analytical skills, Advanced troubleshooting skills and Advanced problem solving skills
Education and Experience Preferred:
Familiarity with infrastructure as code (Terraform, CloudFormation).
Login and enrollment instrumentation using SLO/SLI and measuring FCI and FSI.
Experience in building and maintaining distributed systems at scale.
Knowledge of security best practices in observability.
Certifications in Cloud (AWS, GCP, Azure), SRE or DevOps are a plus.
Process-oriented, Logical thinker
Strong knowledge of server/client and virtual technologies
Adaptable, Able to learn quickly in a rapid pace environment
Bachelor's degree
No related jobs found
← Back to jobs