How to implement native Redis caching layers for high-volume custom taxonomy queries in Timber Twig templating engines

Leveraging Redis for Custom Taxonomy Query Optimization in Timber/Twig

High-volume WordPress sites often encounter performance bottlenecks when querying custom taxonomies, especially within complex Timber/Twig templates. Repeatedly fetching and processing the same taxonomy data can strain database resources and slow down page load times. This document outlines a robust strategy for implementing a native Redis caching layer to dramatically improve the performance of these queries.

Identifying the Bottleneck: Taxonomy Query Patterns

Consider a scenario where a theme displays a list of related posts based on shared custom taxonomy terms. A common implementation might look like this within a Timber context:

// In a Timber context (e.g., functions.php or a custom plugin)

function get_related_posts_by_taxonomy( $post_id, $taxonomy_slug, $limit = 5 ) {
    $terms = wp_get_post_terms( $post_id, $taxonomy_slug, array( 'fields' => 'ids' ) );

    if ( is_wp_error( $terms ) || empty( $terms ) ) {
        return array();
    }

    $args = array(
        'post_type'      => 'post', // Or your custom post type
        'posts_per_page' => $limit,
        'post_status'    => 'publish',
        'tax_query'      => array(
            array(
                'taxonomy' => $taxonomy_slug,
                'field'    => 'term_id',
                'terms'    => $terms,
            ),
        ),
        'post__not_in'   => array( $post_id ), // Exclude the current post
    );

    $query = new WP_Query( $args );

    if ( $query->have_posts() ) {
        return $query->posts;
    }

    return array();
}

// In a Twig template:
// {% set related_posts = Timber\Timber::get_posts( function('get_related_posts_by_taxonomy', _context.post.ID, 'genre') ) %}
// {% for post in related_posts %}
//     <li><a href="{{ post.link }}">{{ post.title }}</a></li>
// {% endfor %}

On a busy product listing page or an article with many tags, `wp_get_post_terms` and `WP_Query` can be executed hundreds or thousands of times per page load. If the taxonomy structure or the terms assigned to posts don’t change frequently, this is a prime candidate for caching.

Implementing a Redis Cache Layer

We’ll create a wrapper function that intercepts taxonomy queries, checks Redis for a cached result, and stores the result in Redis if it’s not found. This requires a robust Redis client for PHP. The popular `predis/predis` library is an excellent choice.

First, ensure you have `predis/predis` installed via Composer:

composer require predis/predis

Redis Client Initialization

It’s crucial to initialize the Redis client once and reuse the connection. A common pattern is to use a singleton or a static property within a dedicated class.

// In a dedicated file, e.g., includes/redis-cache.php, and include it in functions.php

class RedisCache {
    private static $client = null;
    private static $connected = false;

    /**
     * Get the Redis client instance.
     *
     * @return \Predis\Client|null
     */
    public static function get_client() {
        if ( ! self::$client ) {
            try {
                // Configure your Redis connection details
                $redis_config = array(
                    'scheme' => 'tcp',
                    'host'   => defined('REDIS_HOST') ? REDIS_HOST : '127.0.0.1',
                    'port'   => defined('REDIS_PORT') ? REDIS_PORT : 6379,
                    // 'password' => defined('REDIS_PASSWORD') ? REDIS_PASSWORD : null,
                    // 'database' => defined('REDIS_DB') ? REDIS_DB : 0,
                );

                self::$client = new \Predis\Client( $redis_config );
                self::$client->connect(); // Explicitly connect
                self::$connected = true;
            } catch ( \Predis\Connection\ConnectionException $e ) {
                // Log the error or handle it gracefully
                error_log( "Redis connection failed: " . $e->getMessage() );
                self::$connected = false;
                self::$client = null; // Ensure client is null on failure
            }
        }
        return self::$client;
    }

    /**
     * Check if Redis is available.
     *
     * @return bool
     */
    public static function is_available() {
        // Attempt to get client, which will try to connect if not already.
        // If connection fails, get_client() returns null and sets self::$connected to false.
        self::get_client();
        return self::$connected;
    }

    /**
     * Generate a cache key based on query parameters.
     *
     * @param string $base_key A base identifier for the cache.
     * @param array  $params   Parameters to include in the key.
     * @return string
     */
    public static function generate_key( $base_key, array $params = array() ) {
        ksort( $params ); // Ensure consistent key order
        return 'wp_cache:' . $base_key . ':' . md5( json_encode( $params ) );
    }
}

Define constants for your Redis connection in wp-config.php for better manageability:

// In wp-config.php
define( 'REDIS_HOST', '127.0.0.1' );
define( 'REDIS_PORT', 6379 );
// define( 'REDIS_PASSWORD', 'your_redis_password' );
// define( 'REDIS_DB', 1 ); // Use a different DB for WordPress caching

Caching the Taxonomy Query Function

Now, let’s refactor the `get_related_posts_by_taxonomy` function to incorporate Redis caching. We’ll add parameters for cache duration and a unique cache key identifier.

/**
 * Retrieves related posts based on shared taxonomy terms, with Redis caching.
 *
 * @param int    $post_id       The ID of the current post.
 * @param string $taxonomy_slug The slug of the taxonomy to query.
 * @param int    $limit         The maximum number of related posts to retrieve.
 * @param int    $cache_duration The duration in seconds to cache the results.
 * @return array An array of WP_Post objects.
 */
function get_related_posts_by_taxonomy_cached( $post_id, $taxonomy_slug, $limit = 5, $cache_duration = 3600 ) {
    // Check if Redis is available and configured
    if ( ! RedisCache::is_available() ) {
        // Fallback to non-cached function if Redis is down
        return get_related_posts_by_taxonomy( $post_id, $taxonomy_slug, $limit );
    }

    // Generate a unique cache key
    $cache_key_params = array(
        'post_id'       => $post_id,
        'taxonomy_slug' => $taxonomy_slug,
        'limit'         => $limit,
    );
    $cache_key = RedisCache::generate_key( 'related_posts', $cache_key_params );

    $redis_client = RedisCache::get_client();

    // 1. Try to get data from Redis cache
    $cached_data = $redis_client->get( $cache_key );

    if ( $cached_data ) {
        // Cache hit: unserialize and return
        $posts_data = unserialize( $cached_data );
        // Important: Ensure we return WP_Post objects, not just serialized data.
        // If the cached data is an array of post IDs, we'd need to fetch them here.
        // For simplicity, we'll assume we cached the full post objects (or their essential data).
        // A more robust solution might cache post IDs and reconstruct objects.
        // For this example, let's assume we cached serialized WP_Post objects.
        // If unserialize fails, it returns false.
        if ( $posts_data !== false ) {
            // Re-instantiate WP_Post objects if necessary, or ensure they are properly serialized.
            // A common pattern is to cache an array of post IDs and then fetch them.
            // Let's refine this to cache post IDs for better serialization and retrieval.
            // For now, let's assume we cached an array of post IDs.
            if ( is_array( $posts_data ) ) {
                $posts = array();
                foreach ( $posts_data as $post_id_from_cache ) {
                    $post_object = get_post( $post_id_from_cache );
                    if ( $post_object instanceof WP_Post ) {
                        $posts[] = $post_object;
                    }
                }
                return $posts;
            }
        }
    }

    // 2. Cache miss: Fetch data from WordPress
    $terms = wp_get_post_terms( $post_id, $taxonomy_slug, array( 'fields' => 'ids' ) );

    if ( is_wp_error( $terms ) || empty( $terms ) ) {
        return array(); // No terms, no related posts
    }

    $args = array(
        'post_type'      => 'post', // Or your custom post type
        'posts_per_page' => $limit,
        'post_status'    => 'publish',
        'tax_query'      => array(
            array(
                'taxonomy' => $taxonomy_slug,
                'field'    => 'term_id',
                'terms'    => $terms,
            ),
        ),
        'post__not_in'   => array( $post_id ), // Exclude the current post
        'fields'         => 'ids', // Fetch only post IDs for efficient caching
    );

    $query = new WP_Query( $args );
    $post_ids = $query->have_posts() ? $query->posts : array();

    // 3. Store data in Redis cache
    if ( ! empty( $post_ids ) ) {
        // Cache the array of post IDs
        $redis_client->setex( $cache_key, $cache_duration, serialize( $post_ids ) );
    }

    // 4. Return the fetched posts (as WP_Post objects)
    $posts = array();
    foreach ( $post_ids as $post_id_from_query ) {
        $post_object = get_post( $post_id_from_query );
        if ( $post_object instanceof WP_Post ) {
            $posts[] = $post_object;
        }
    }

    return $posts;
}

// In your Twig template, you would now call the cached function:
// {% set related_posts = Timber\Timber::get_posts( function('get_related_posts_by_taxonomy_cached', _context.post.ID, 'genre', 5, 7200) ) %}
// Note: Timber::get_posts expects an array of post objects or an ID.
// If get_related_posts_by_taxonomy_cached returns an array of WP_Post objects,
// Timber can directly use it. If it returns IDs, Timber will fetch them.
// For optimal performance, ensure the function returns WP_Post objects or IDs that Timber can efficiently process.
// The example above returns WP_Post objects.

Cache Invalidation Strategies

Caching is only effective if the data remains fresh. For taxonomy-related queries, invalidation is key. Here are common strategies:

On Post Save/Update: When a post is saved or updated, its associated taxonomy terms might change, affecting related post queries. Hook into `save_post` or `wp_insert_post` to clear relevant cache entries.
On Term Save/Update: Changes to terms themselves (e.g., renaming) or their relationships to posts necessitate cache invalidation. Hook into `created_term`, `edited_term`, `delete_term`.
Manual Invalidation: Provide an admin interface or a WP-CLI command to clear specific cache keys or all taxonomy-related caches.

Implementing Cache Invalidation Hooks

Here’s an example of how to invalidate cache entries when a post is saved. This is a simplified example; a production system would need more sophisticated key management.

/**
 * Invalidate related posts cache when a post is saved.
 */
function invalidate_related_posts_cache_on_save( $post_id ) {
    // Only proceed if this is not an autosave and the post type is relevant
    if ( defined( 'DOING_AUTOSAVE' ) && DOING_AUTOSAVE ) {
        return;
    }
    if ( ! current_user_can( 'edit_post', $post_id ) ) {
        return;
    }

    // Check if Redis is available
    if ( ! RedisCache::is_available() ) {
        return;
    }

    $redis_client = RedisCache::get_client();

    // This is a naive approach: it clears ALL related posts caches.
    // A better approach would be to identify WHICH caches to clear based on the post's updated terms.
    // For a more granular approach, you'd fetch the terms for the post,
    // then iterate through known taxonomies and generate keys to delete.

    // Example of clearing a specific key if you know the taxonomy and limit:
    // $taxonomy_slug = 'genre'; // Example taxonomy
    // $limit = 5;
    // $cache_key_params = array( 'post_id' => $post_id, 'taxonomy_slug' => $taxonomy_slug, 'limit' => $limit );
    // $cache_key = RedisCache::generate_key( 'related_posts', $cache_key_params );
    // $redis_client->del( $cache_key );

    // For a broader invalidation, consider using Redis KEYS (with caution on production)
    // or a pattern-based deletion if your Redis client supports it.
    // A safer approach is to maintain a list of keys to invalidate in a separate Redis set.

    // Example: Scan for keys matching a pattern (use with extreme caution on large Redis instances)
    // $pattern = 'wp_cache:related_posts:*';
    // $keys_to_delete = $redis_client->keys( $pattern );
    // if ( ! empty( $keys_to_delete ) ) {
    //     $redis_client->del( $keys_to_delete );
    // }

    // A more robust method: Maintain a list of keys associated with a post ID or term ID.
    // When a post is updated, fetch its terms, then fetch the list of cache keys associated with those terms/post.
    // For instance, a Redis Set named 'post_related_cache_keys:' . $post_id
    // On save:
    // 1. Get current terms for $post_id.
    // 2. For each term, get its associated cache keys from a Set (e.g., 'term_related_cache_keys:' . $term_id).
    // 3. Delete those keys from Redis.
    // 4. Add new cache keys to the Sets if applicable.
}
add_action( 'save_post', 'invalidate_related_posts_cache_on_save', 10, 1 );

/**
 * Invalidate cache when a term is created, edited, or deleted.
 */
function invalidate_taxonomy_cache_on_term_change( $term_id, $tt_id, $taxonomy ) {
    if ( ! RedisCache::is_available() ) {
        return;
    }
    $redis_client = RedisCache::get_client();

    // This is a broad invalidation. For granular, you'd need to track which posts use this term.
    // A common pattern is to clear caches related to the specific taxonomy.
    // For example, if 'genre' taxonomy terms change, clear all 'related_posts' caches that use 'genre'.
    // This requires a more complex key structure or a separate index.

    // Example: Clear all caches that might be affected by a taxonomy change.
    // This is highly inefficient and should be avoided in production.
    // A better approach is to use a Redis Set to track keys associated with terms.
    // For example, a Set named 'term_cache_keys:' . $taxonomy . ':' . $term_id
    // When a term is updated, iterate through this set and delete the keys.

    // For demonstration, let's assume a simple pattern-based deletion for a specific taxonomy.
    // This is still risky on large datasets.
    // $pattern = 'wp_cache:related_posts:*' . $taxonomy . '*'; // This pattern is too broad.
    // A more targeted pattern would be needed.

    // A more practical approach:
    // When a term is updated, find all posts associated with that term.
    // For each post, invalidate its specific related posts cache.
    // This can be done by querying posts by term, then for each post,
    // generating and deleting its cache keys. This can be resource-intensive.

    // A common compromise: Invalidate caches for a specific taxonomy if its structure changes significantly.
    // For example, if a term is deleted, any cache entry referencing that term is now stale.
    // A robust solution involves a mapping of terms to cache keys.
}
add_action( 'created_term', 'invalidate_taxonomy_cache_on_term_change', 10, 3 );
add_action( 'edited_term', 'invalidate_taxonomy_cache_on_term_change', 10, 3 );
add_action( 'delete_term', 'invalidate_taxonomy_cache_on_term_change', 10, 3 );

Advanced Considerations and Best Practices

Cache Key Granularity: Design cache keys carefully. Include all relevant parameters that define the query’s uniqueness (post ID, taxonomy, limit, user roles if applicable, etc.). Use a consistent prefix (e.g., `wp_cache:`).
Serialization: `serialize()` and `unserialize()` are generally safe for caching WordPress objects and arrays. However, be mindful of object serialization if your objects contain complex, non-serializable resources. Caching post IDs and then fetching `WP_Post` objects on cache hit is often more efficient and less prone to serialization issues.
Cache Duration (`TTL`): Set appropriate Time-To-Live (TTL) values. For taxonomy data that changes infrequently, a longer TTL (e.g., hours) is suitable. For dynamic content, shorter TTLs or no caching might be necessary.
Redis Memory Management: Configure Redis with appropriate memory limits and eviction policies (e.g., `allkeys-lru`) to prevent it from consuming all available RAM.
Monitoring: Regularly monitor Redis performance, memory usage, and cache hit/miss ratios. Tools like RedisInsight or `redis-cli monitor` can be invaluable.
Error Handling: Implement robust error handling for Redis connections and operations. Gracefully degrade to non-cached behavior if Redis is unavailable.
Security: If Redis is exposed externally, ensure it’s protected by strong passwords and firewall rules.
Timber Integration: Ensure your Timber setup correctly calls the cached functions. Timber’s `Timber::get_posts()` can often directly consume an array of `WP_Post` objects returned by your cached function.

Conclusion

By implementing a native Redis caching layer for custom taxonomy queries within Timber/Twig, you can achieve significant performance gains, reduce database load, and provide a snappier user experience. The key lies in thoughtful cache key design, effective invalidation strategies, and robust error handling. This approach transforms potentially slow, repetitive database operations into near-instantaneous cache lookups, making your high-volume WordPress site more scalable and performant.