Architecture Overview
Grafana Dashboard Access
Verify Data Source
Navigate to Configuration → Data Sources to verify Prometheus connection:
- Name: Prometheus
- Type: prometheus
- URL:
http://prometheus:9090 - UID:
prometheus - Status: Should show green checkmark
Grafana is configured with anonymous access enabled (Editor role), allowing dashboard viewing without authentication.
Prometheus Metrics
Access Prometheus UI
Key Prometheus Endpoints
Key Prometheus Endpoints
- Targets: http://localhost:9090/targets
- Configuration: http://localhost:9090/config
- Service Discovery: http://localhost:9090/service-discovery
- Alerts: http://localhost:9090/alerts
Prometheus Configuration
The Prometheus server scrapes metrics from gNMIc:Verify Metric Collection
Query Metrics in Prometheus
Navigate to http://localhost:9090/graph and try:
Check Target Health
Visit http://localhost:9090/targets and verify:
- Target:
gnmic:9273 - State: UP
- Last Scrape: < 5s ago
gNMIc Telemetry Collector
Configuration Overview
The gNMIc collector automatically discovers and subscribes to Nokia devices using Docker labels.Device Discovery Configuration
Device Discovery Configuration
Active Subscriptions
SR Linux Subscriptions
SR Linux Subscriptions
Sample Interval: 5 seconds
srl_platform: CPU and memory usagesrl_apps: Application managementsrl_if_stats: Interface statistics and operational statesrl_if_lag_stats: LAG member statisticssrl_net_instance: Network instance state and route tablessrl_bgp_stats: BGP protocol statisticssrl_event_handler_stats: Event handler metrics
SR OS Subscriptions (BNG/Switch/OLT)
SR OS Subscriptions (BNG/Switch/OLT)
Sample Interval: 5 seconds (10s for VPLS SAPs)
sros_ports_stats: Port operational state and statisticssros_router_bgp: BGP statistics and routes per familysros_router_interface: IPv4/IPv6 interface statisticssros_router_isis: IS-IS protocol statisticssros_router_route_table: Route table statisticssros_system: CPU and memory pool usagesros_service_stats: VPLS/VPRN service operational statesros_ludb: Local user database (subscriber info)sros_vpls_sap_all: VPLS SAP statisticssros_temperature_stats: Hardware temperature sensorssros_fan_stats: Chassis fan speeds
View gNMIc Logs
Key Metrics to Monitor
System Health Metrics
CPU Usage
CPU Usage
Memory Usage
Memory Usage
Interface Status
Interface Status
Critical interfaces should be monitored with alerts for state changes.
BNG-Specific Metrics
Subscriber Sessions
Subscriber Sessions
VPLS Service Status
VPLS Service Status
Network Performance Metrics
Interface Traffic
Interface Traffic
Interface Errors
Interface Errors
BGP Sessions
BGP Sessions
RADIUS Accounting Logs
Access RADIUS Logs
RADIUS Log Locations
RADIUS Log Locations
- Main Log:
/var/log/radius/radius.log - Accounting:
/var/log/radius/radacct/ - Configuration:
/etc/raddb/
Device Health Monitoring
Temperature Monitoring
Fan Speed Monitoring
Fan speed should remain consistent. Sudden drops may indicate hardware issues.
Container Logs and Monitoring
View Container Logs
Network Devices
Network Devices
Telemetry Stack
Telemetry Stack
Services
Services
Container Resource Usage
Alerting Best Practices
Troubleshooting Monitoring Issues
No Metrics in Prometheus
No Metrics in Prometheus
- Check gNMIc is running:
docker ps | grep gnmic - Verify gNMIc metrics endpoint:
curl http://localhost:9273/metrics - Check Prometheus targets: http://localhost:9090/targets
- Review gNMIc logs:
docker logs clab-lab-gnmic
Grafana Shows No Data
Grafana Shows No Data
- Verify Prometheus data source connection in Grafana
- Check time range in dashboard (default: last 6 hours)
- Run test query in Prometheus UI first
- Ensure dashboards are using correct metric names
gNMIc Connection Failures
gNMIc Connection Failures
- Verify device gRPC port is accessible:
netstat -tuln | grep 57400 - Check credentials:
admin/lab123 - Confirm Docker socket is mounted:
docker exec clab-lab-gnmic ls -l /var/run/docker.sock - Review device labels:
docker inspect clab-lab-bng1 | grep clab-node-kind