โก Caching
This guide explains how to work with caching in Django Structured JSON Field.
๐ Overview
The field implements a caching mechanism to optimize queries during serialization and prevent multiple identical queries, especially when dealing with relationships such as ForeignKey
and QuerySet
fields.
๐ Current Cache Features
- โ
Shared cache between
ForeignKey
fields andQuerySet
fields - โ Shared cache through nested schemas
- โ Shared cache through nested lists of schemas
- โณ Shared cache between all
StructuredJSONFields
in the same instance - โณ Shared cache between multiple instances of the same model
- โณ Cache invalidation mechanism
๐ง Configuration
๐ Basic Settings
Configure caching in your settings.py
:
STRUCTURED_FIELD = {
'CACHE': {
'ENABLED': True, # Default is True
'SHARED': False # โ ๏ธ EXPERIMENTAL: enables thread-shared cache
},
}
๐ Cache Options
- ENABLED: Enable/disable caching globally (default:
True
) - SHARED: Enable thread-shared cache (experimental feature, default:
False
)
๐ง How Caching Works
The caching system in Django Structured JSON Field is designed to optimize query performance when working with related objects:
- When a related object is first accessed, it's fetched from the database and stored in the cache
- Subsequent accesses to the same related object will use the cached version instead of making another database query
- The cache is maintained during the serialization process, improving performance especially with deeply nested structures
๐ข Without Caching (Multiple Queries)
When caching is disabled, each access to a related field requires a separate database query:
# Without caching, this makes 6 separate database queries
instance = TestModel.objects.first()
instance.structured_data.fk_field.name
instance.structured_data.child.fk_field.name
instance.structured_data.child.child.fk_field.name
instance.structured_data.child.child.child.fk_field.name
instance.structured_data.child.child.child.child.fk_field.name
instance.structured_data.child.child.child.child.child.fk_field.name
๐ With Caching (Single Query)
When caching is enabled, related objects are fetched with a single optimized query:
# With caching, this makes just 1 database query for all related objects
instance = TestModel.objects.first()
instance.structured_data.fk_field.name
instance.structured_data.child.fk_field.name
instance.structured_data.child.child.fk_field.name
instance.structured_data.child.child.child.fk_field.name
instance.structured_data.child.child.child.child.fk_field.name
instance.structured_data.child.child.child.child.child.fk_field.name
๐ Cache Performance Benefits
The caching system provides significant performance benefits in several scenarios:
1๏ธโฃ Nested Foreign Keys
class ChildSchema(BaseModel):
name: str
fk_field: SomeModel = None
class ParentSchema(BaseModel):
name: str
child: ChildSchema = None
fk_field: SomeModel = None
When accessing multiple foreign keys in a nested structure, the cache ensures efficient queries.
2๏ธโฃ QuerySet Fields
class MySchema(BaseModel):
name: str
qs_field: QuerySet[SomeModel]
When working with QuerySet fields, the cache ensures the related objects are fetched efficiently.
3๏ธโฃ Lists of Nested Schemas
class ChildSchema(BaseModel):
name: str
fk_field: SomeModel = None
class ParentSchema(BaseModel):
name: str
childs: List[ChildSchema] = []
The cache works effectively even with lists of schemas containing relationships.
๐ Cache Behavior in Different Modes
๐ Standard Cache (SHARED=False)
- Cache is maintained per instance
- Optimizes queries within a single instance
- Safe for production use
๐ Shared Cache (SHARED=True)
- Cache is shared across all instances
- Provides maximum optimization but with some risks
- Experimental and not recommended for production without testing
๐ก Best Practices
โ๏ธ Cache Configuration:
- Keep caching enabled in production for performance benefits
- Test thoroughly before enabling shared cache in production
- Consider disabling cache only for debugging purposes
๐ Query Optimization:
- Use appropriate database indexes on related model fields
- Still use
select_related()
when fetching initial models - Consider query patterns in your application design
๐งช Testing:
- Test your application with both cache enabled and disabled
- Verify behavior with both standard and shared cache
- Use profiling tools to measure performance improvements
โ Common Issues and Solutions
๐ Stale Data:
- Cache doesn't automatically update when related objects change
- Refresh objects from database when needed
- Consider application design to minimize cache staleness
๐พ Memory Usage:
- Monitor memory consumption with large datasets
- Be aware of the memory tradeoff for performance
- Consider clearing cache for very large operations
๐ง Testing Different Cache Configurations
You can test different cache configurations by modifying your settings:
# Disable cache entirely (for testing or debugging)
STRUCTURED_FIELD = {
'CACHE': {
'ENABLED': False,
},
}
# Enable standard cache (recommended for production)
STRUCTURED_FIELD = {
'CACHE': {
'ENABLED': True,
'SHARED': False,
},
}
# Enable shared cache (experimental)
STRUCTURED_FIELD = {
'CACHE': {
'ENABLED': True,
'SHARED': True,
},
}
๐ Next Steps
After understanding caching, you might want to explore:
- ๐ REST Framework Integration for API optimization
- ๐ Relationships for complex data structures
- ๐งฐ Admin Integration for admin interface optimization