UST Global Inc
Job Summary
We are seeking a skilled Grafana Administrator to lead the implementation, configuration, and ongoing management of Grafana for real-time monitoring and visualization. This role demands expertise in integrating Grafana with diverse data sources, developing custom dashboards, managing user access, and ensuring optimal system performance.
Key Responsibilities
Grafana Setup and Configuration
Install, configure, and upgrade Grafana on on-premises or cloud-based infrastructure.
Integrate Grafana with data sources such as Prometheus, Elasticsearch, MySQL, PostgreSQL, and other supported tools.
Dashboard Development
Design and build custom dashboards to deliver insights into system and application performance.
Create intuitive and functional visualizations suitable for both technical and non-technical stakeholders.
ing and Notification
Configure ing mechanisms and notification channels (e.g., Microsoft Teams, PagerDuty, email).
Monitor system and application health using advanced metrics and thresholds.
Access Management
Manage user roles and permissions for secure and efficient access to Grafana.
Implement Single Sign-On (SSO) and LDAP integration as required.
Performance Optimization
Monitor and optimize Grafana performance for scalability and speed.
Diagnose and resolve performance or configuration issues.
Documentation and Training
Develop documentation for Grafana configurations, dashboards, and best practices.
Conduct training sessions to empower team members to use Grafana effectively.
Collaboration
Collaborate with DevOps, SRE, and development teams to understand and address monitoring requirements.
Align Grafana implementations with business and operational goals.
Security and Compliance
Ensure Grafana adheres to security policies and industry best practices.
Regularly review and address vulnerabilities in Grafana installations.
Required Skills and Qualifications
Proficiency in Grafana implementation, administration, and customization.
Experience integrating Grafana with tools like Prometheus, Graphite, InfluxDB, or Elasticsearch.
Strong understanding of time-series data and metrics visualization.
Familiarity with monitoring tools (e.g., Prometheus) and cloud platforms (AWS, Azure).
Expertise in Linux/Unix system administration.
Hands-on experience with scripting languages (e.g., Python, Bash) for automation.
Knowledge of DevOps tools such as Kubernetes, Docker, and CI/CD pipelines.
Excellent problem-solving and troubleshooting skills.
Strong communication and teamwork capabilities.
Preferred Qualifications
Experience with Grafana plugins and API usage.
Knowledge of Infrastructure as Code (IaC) tools (e.g., Terraform, Ansible).
Certification in Grafana or other monitoring/visualization tools.