Server Monitoring Best Practices: Keeping Your WooCommerce App and MongoDB Clusters Alive on Google Cloud

Proactive MongoDB Health Checks with Google Cloud Monitoring & Prometheus

Maintaining the health and performance of MongoDB clusters, especially those powering critical applications like WooCommerce, demands a robust monitoring strategy. On Google Cloud Platform (GCP), this involves leveraging native tools and integrating with open-source solutions like Prometheus for deeper insights. We’ll focus on key metrics that indicate potential issues before they impact your users.

A fundamental aspect of MongoDB monitoring is tracking connection counts, query performance, and resource utilization. For this, we’ll set up Prometheus to scrape metrics exposed by MongoDB via the `mongodb_exporter`. This exporter can be deployed as a sidecar container within your Kubernetes pods or as a standalone service.

Deploying and Configuring `mongodb_exporter`

First, ensure you have Prometheus and Grafana deployed on your GCP infrastructure, likely within a GKE cluster. The `mongodb_exporter` can be deployed using Helm or directly as a Kubernetes Deployment. Here’s a sample Kubernetes Deployment manifest:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: mongodb-exporter
  namespace: monitoring
spec:
  replicas: 1
  selector:
    matchLabels:
      app: mongodb-exporter
  template:
    metadata:
      labels:
        app: mongodb-exporter
    spec:
      containers:
      - name: mongodb-exporter
        image: percona/mongodb_exporter:latest
        ports:
        - containerPort: 9274
        env:
        - name: MONGODB_EXPORTER_HOST
          value: "your-mongodb-replica-set-or-standalone-host:27017" # e.g., mongo-0.mongo-headless.default.svc.cluster.local
        - name: MONGODB_EXPORTER_USERNAME
          valueFrom:
            secretKeyRef:
              name: mongodb-exporter-secrets
              key: username
        - name: MONGODB_EXPORTER_PASSWORD
          valueFrom:
            secretKeyRef:
              name: mongodb-exporter-secrets
              key: password
        - name: MONGODB_EXPORTER_COLLECTION
          value: "admin" # Or your specific database for authentication
        # Optional: For TLS/SSL
        # - name: MONGODB_EXPORTER_TLS
        #   value: "true"
        # - name: MONGODB_EXPORTER_TLS_CA_FILE
        #   value: "/etc/ssl/certs/ca-certificates.crt" # Or path to your CA cert
        # - name: MONGODB_EXPORTER_TLS_CERT_FILE
        #   value: "/etc/ssl/certs/mongodb.crt"
        # - name: MONGODB_EXPORTER_TLS_KEY_FILE
        #   value: "/etc/ssl/certs/mongodb.key"

You’ll need to create a Kubernetes Secret named `mongodb-exporter-secrets` containing the `username` and `password` for the MongoDB user that `mongodb_exporter` will use. This user should have at least the `clusterMonitor` role.

Prometheus Configuration for Scraping

Next, configure your Prometheus instance to scrape the metrics from the `mongodb_exporter`. If you’re using the Prometheus Operator, you can define a `ServiceMonitor` resource:

apiVersion: monitoring.coreos.com/v1
kind: ServiceMonitor
metadata:
  name: mongodb-exporter
  namespace: monitoring # Ensure this matches your Prometheus namespace
  labels:
    release: prometheus # Or your Prometheus release name
spec:
  selector:
    matchLabels:
      app: mongodb-exporter
  namespaceSelector:
    matchNames:
    - monitoring # Namespace where mongodb-exporter is deployed
  endpoints:
  - port: web
    interval: 30s
    path: /metrics

This `ServiceMonitor` tells the Prometheus Operator to discover the `mongodb-exporter` service (based on labels) and configure Prometheus to scrape its `/metrics` endpoint on port `9274` every 30 seconds.

Key MongoDB Metrics to Monitor

Once Prometheus is scraping, focus on these critical metrics in Grafana dashboards:

mongodb_connections_current: The number of currently open connections. High values can indicate connection leaks or insufficient connection pooling.
mongodb_commands_total: Total number of commands executed. Monitor specific command rates (e.g., `insert`, `query`, `update`, `delete`) to identify performance bottlenecks.
mongodb_network_bytes_in_total and mongodb_network_bytes_out_total: Network traffic. Spikes can indicate heavy read/write operations or inefficient queries.
mongodb_opcounters_total: Operation counters for inserts, queries, updates, deletes, etc. Track rates of change to understand workload.
mongodb_replication_lag_seconds: For replica sets, this is crucial. High lag indicates that secondary nodes are not keeping up with primary writes, potentially leading to stale reads or data loss during failover.
mongodb_storage_data_size_bytes and mongodb_storage_free_storage_bytes: Disk usage. Monitor for approaching capacity limits.
mongodb_global_lock_percent: Percentage of time the global lock is held. High values indicate contention and can severely impact write performance.
mongodb_page_faults_total: Number of page faults. High rates suggest insufficient RAM, leading to excessive disk I/O.

Set up alerts in Prometheus Alertmanager for critical thresholds on these metrics. For example, an alert for `mongodb_replication_lag_seconds` exceeding 60 seconds on any secondary node is a common and important alert.

WooCommerce Application Performance Monitoring (APM) with Google Cloud Trace & Logging

WooCommerce, being a PHP application, requires monitoring at the application layer to identify slow database queries, inefficient PHP code, and external API call latency. Google Cloud’s integrated APM tools, Cloud Trace and Cloud Logging, are invaluable here.

Integrating Cloud Trace with PHP

To enable distributed tracing for your WooCommerce application, you’ll need to instrument your PHP code. The most common way is by using the OpenTelemetry PHP SDK or a vendor-specific agent. For GCP, the OpenTelemetry Collector can forward traces to Cloud Trace.

First, install the OpenTelemetry PHP extension and the necessary libraries. This is typically done via Composer:

composer require open-telemetry/sdk
composer require open-telemetry/exporter-otlp
composer require open-telemetry/auto-instrumentation-http
composer require open-telemetry/auto-instrumentation-psr18
composer require open-telemetry/auto-instrumentation-mongodb # If using the official MongoDB PHP driver

Next, configure the OpenTelemetry SDK to export traces to the OpenTelemetry Collector, which will then forward them to Cloud Trace. This often involves setting environment variables or a configuration file.

A basic PHP bootstrap script might look like this:

<?php
require 'vendor/autoload.php';

use OpenTelemetry\API\Trace\TracerProviderInterface;
use OpenTelemetry\SDK\Trace\TracerProvider;
use OpenTelemetry\SDK\Trace\SpanProcessor\BatchSpanProcessor;
use OpenTelemetry\SDK\Trace\SpanExporter\OtlpExporter;
use OpenTelemetry\SDK\Resource\ResourceInfo;
use OpenTelemetry\Context\Propagation\TraceContextPropagator;
use OpenTelemetry\Extension\AutoInstrumentation\Http\Psr18\HttpClientAutoInstrumentation;
use OpenTelemetry\Extension\AutoInstrumentation\Psr18\Psr18ClientHttpFactory;
use OpenTelemetry\Extension\AutoInstrumentation\MongoDb\MongoDbInstrumentation;

// Configure the exporter to send traces to the OTLP collector (e.g., running locally or on a sidecar)
$exporter = new OtlpExporter('http://localhost:4318'); // Adjust collector endpoint as needed

// Create a tracer provider
$tracerProvider = new TracerProvider(
    new BatchSpanProcessor($exporter),
    ResourceInfo::create(attributes: [
        'service.name' => 'woocommerce-app',
        'deployment.environment' => getenv('ENVIRONMENT') ?: 'production',
    ])
);

// Register the tracer provider globally
\OpenTelemetry\API\GlobalTracerProvider::set($tracerProvider);

// Register propagators for context propagation
\OpenTelemetry\API\OpenTelemetry::setPropagator(new TraceContextPropagator());

// Enable auto-instrumentation for HTTP clients and MongoDB driver
// Ensure your HTTP client factory (e.g., Guzzle) is compatible with Psr18ClientHttpFactory
$httpClientFactory = new Psr18ClientHttpFactory();
HttpClientAutoInstrumentation::register($httpClientFactory);
MongoDbInstrumentation::register(); // For the official mongodb driver

// Your WooCommerce application bootstrap code follows...
// e.g., $app = require __DIR__ . '/bootstrap/app.php';
// $app->run();

// Ensure spans are flushed on shutdown
register_shutdown_function(function () use ($tracerProvider) {
    $tracerProvider->shutdown();
});
?>

You’ll need an OpenTelemetry Collector running, configured to receive OTLP gRPC/HTTP and export to Google Cloud Trace. A minimal collector configuration might look like this:

receivers:
  otlp:
    protocols:
      grpc:
      http:

exporters:
  googlecloudtrace:
    project: "your-gcp-project-id" # Replace with your GCP Project ID
    # Optional: If running outside GCP, you might need to configure authentication
    # credentials: "/path/to/your/service-account-key.json"

service:
  pipelines:
    traces:
      receivers: [otlp]
      exporters: [googlecloudtrace]

Deploy this collector as a Kubernetes Deployment or DaemonSet within your GKE cluster, ensuring its `googlecloudtrace` exporter is configured with your GCP project ID and appropriate authentication if necessary.

Leveraging Cloud Logging for WooCommerce Errors and Slow Queries

Cloud Logging is essential for capturing application logs, including PHP errors, warnings, and custom log messages. For WooCommerce, ensure your `php.ini` is configured to log errors and that these logs are being sent to Cloud Logging.

; php.ini settings
error_reporting = E_ALL
display_errors = Off
log_errors = On
error_log = /var/log/php/error.log ; Ensure this path is writable by your web server/PHP-FPM

If running on GKE, the Cloud Logging agent (often Fluentd or Fluent Bit) deployed by default will pick up logs from standard output/error or specified log files. You can configure custom log parsing and routing using the agent’s configuration.

To specifically capture slow MongoDB queries, you can enable the MongoDB slow query log. Configure MongoDB to log queries exceeding a certain threshold (e.g., 100ms):

# mongod.conf
systemLog:
  destination: file
  path: /var/log/mongodb/mongod.log
  logAppend: true
  verbosity: 0 # Default verbosity
  quiet: false
  timeStampFormat: iso8601-utc

# Enable slow query logging
operationProfiling:
  slowOpThresholdMs: 100 # Log operations slower than 100ms
  mode: "slowOp" # Or "all" for full profiling

Ensure the log file path is accessible by your logging agent. You can then create log sinks in GCP to route these slow query logs to BigQuery for analysis or to Pub/Sub for real-time alerting.

GCP Compute Engine & Load Balancer Monitoring

For WooCommerce instances running on Compute Engine VMs or behind Google Cloud Load Balancers, GCP’s native monitoring provides essential infrastructure-level metrics.

Compute Engine VM Metrics

Google Cloud Monitoring automatically collects metrics from Compute Engine instances. Key metrics to watch include:

compute.googleapis.com/instance/cpu/utilization: CPU usage.
compute.googleapis.com/instance/memory/usage: Memory usage (requires the Ops Agent or legacy Stackdriver agent).
compute.googleapis.com/instance/disk/read_bytes_count and write_bytes_count: Disk I/O.
compute.googleapis.com/instance/network/sent_bytes_count and received_bytes_count: Network traffic.

Set up custom metrics dashboards in GCP’s Cloud Monitoring console for your WooCommerce VMs. Create alerting policies for CPU utilization exceeding 80% for sustained periods, low disk space, or unusual network traffic patterns.

Load Balancer Monitoring

Google Cloud Load Balancers (HTTP(S), TCP, UDP) offer critical insights into traffic distribution and health.

loadbalancing.googleapis.com/https/request_count: Total requests processed.
loadbalancing.googleapis.com/https/backend_latencies: Latency from the load balancer to the backend instances.
loadbalancing.googleapis.com/https/backend_response_codes_count: Distribution of HTTP response codes (2xx, 3xx, 4xx, 5xx). A surge in 5xx errors is a critical indicator of backend issues.
loadbalancing.googleapis.com/https/backend_health_check_status: Health status of backend instances.

Configure alerting policies for:

A significant increase in 5xx error codes.
A high percentage of unhealthy backend instances.
Unusual spikes in request latency.

For advanced analysis, export load balancer logs to BigQuery. This allows for detailed query analysis of traffic patterns, error sources, and user behavior.

Automated Alerting and Incident Response

A robust monitoring system is only effective if it triggers timely alerts and facilitates rapid incident response. Integrate your monitoring tools with communication platforms like Slack or PagerDuty.

For Prometheus, configure Alertmanager to route alerts based on severity and type. For GCP Cloud Monitoring, create alerting policies that specify notification channels.

Consider implementing automated remediation actions for common issues. For example, if CPU utilization on a WooCommerce VM consistently exceeds a threshold, an automated script could trigger a VM resize or restart. This requires careful design and testing to avoid unintended consequences.

Regularly review your monitoring dashboards and alert configurations. As your WooCommerce application scales and evolves, your monitoring strategy must adapt to ensure continuous availability and performance.