Byte Rot: CacheCow 0.4 released: new features and a breaking change

Monday, 18 March 2013

CacheCow 0.4 released: new features and a breaking change

Version 0.4 is out and with it a couple of features such as attribute-based cache control and cache refresh (see below). For the first time I felt that I have got to write a few notes about this version, least of which because of one breaking change in this release - although it is not likely to break your code as I explain further. Changes have been in the server components.

I and other contributors have been working on CacheCow for the last 8 months. I thought with a couple of posts I have explained the usage of CacheCow. But I now feel that with concept counts increasing, I need to start a series on CacheCow. Before doing that I am going to explain new concepts and the breaking change.

Breaking change

The breaking change was a change in the signature of CacheControlHeaderProvider from Func<HttpRequestMessage, CacheControlHeaderValue> to Func<HttpRequestMessage, HttpConfiguration, CacheControlHeaderValue> to accept HttpConfiguration.

If you have provided your own CacheControlHeaderProvider, you need to provide HttpConfiguration as well - which should be very easy to fix whether web-host or self-host.

Cache Control Policy

So defining cache policy against resource have been by setting the value of CacheControlHeaderProvider which you would define whether a resource is cacheable and if it is, what is the expiry (and other related stuff):

public Func<HttpRequestMessage, HttpConfiguration, CacheControlHeaderValue> CacheControlHeaderProvider { get; set; }

So by default CacheCow sets Func to return a header value for private caching with immediate expiry for all resources:

CacheControlHeaderProvider = (request, cfg) => new CacheControlHeaderValue()
{
 Private = true,
 MustRevalidate = true,
 NoTransform = true,
 MaxAge = TimeSpan.Zero
};

Immediate expiry actually means that the client can use the expired resource as long as it validates the resource using a conditional GET - as explained before here.

But what if you want to individualise cache policy for each resource? We could use per-route handlers but that is not ideal and generally it depends on the resource organisation approach. I have explained in my previous post that resource organisation is one of the areas that needs to be looked at. But this is not within the scope of CacheCow. We are looking into solving this as part of another project while ASP.NET team are also looking into this. So I have decoupled the resource organisation project from CacheCow.

Having said that, in the meantime, I am going to provide some help with doing cache policy set up less painful. This means that CacheCow will come with a few pre-defined functions that help you with defining your cache control policy.

Good news! Cache policy definition using attributes

So now you can define your cache policy against your actions or controllers or both - although action attribute always takes precedence over controller. Using the popular ASP.NET Web API sample:

    [HttpCacheControlPolicy(true, 100)]
    public class ValuesController : ApiController
    {

        public IEnumerable<string> Get()
        {
            return new[] { "cache", "cow" };
        }

        [HttpCacheControlPolicy(true, 120)]
        public string Get(int id)
        {
            return "cache cow... mooowwwww";
        }

So GET call to the first action (/api/Values) will have a max-age of 100 while GET to the second action (e.g. /api/Values/1) will return a max-age of 120.

In order to set this up, all you have to do is to set the CacheControlHeaderProvider property of your CachingHandler to GetCacheControl method of an instance of AttributeBasedCacheControlPolicy:

cachingHandler.CacheControlHeaderProvider = new AttributeBasedCacheControlPolicy(
 new CacheControlHeaderValue()
  {
   NoStore = true
  }).GetCacheControl;

So in above we have passed a default caching policy of no-caching. This table defines which attribute value (or default provided in the constructor) is used:

Cache Refresh Policy

CacheCow works best when HTTP API is actually a REST API. In other words, it uses uniform interface (i.e. HTTP Verbs) to modify resources and this means that the caching handler will get the opportunity to invalidate and remove the cache when POST, PUT, DELETE or PATCH is used.

Problem is commonly HTTP API sits on the top of a legacy system where it has not control over modifications of resources and acts as a data provider. In such a case, the API will not be notified on resource changes and application will be responsible for removing cache metadata directly on the EntityTagStore used. And this is not always possible.

I am providing a solution for defining a time based cache refresh policy using attributes in a very similar fashion to Cache Control Policy - even the above table applies. Removal of items from cache store on the server happens upon the first request after the refresh interval has passed not immediately after interval. So we add the refresh policy provider:

cachingHandler.CacheRefreshPolicyProvider = new AttributeBasedCacheRefreshPolicy(TimeSpan.FromSeconds(5 * 60 * 60))
    .GetCacheRefreshPolicy;

We have defined 5 hour refresh policy as default. And we override using controller or action attributes.

Future posts

As promised, next few posts will be a CacheCow walk-through.

21 comments:

Unknown2 April 2013 at 22:14
Hi aliostad. I really appreciated your work. I was working on Asp.NET Web API Service, particularly on the cache part, and I found your implementation really helpful.

I would be very grateful if you could help me with a question. I saw in the CachingHandler class that the eTag construction is based on a random Guid, but those eTags would be lost if you restart the service (if you don't use a db to store them). I'd like to know if would it be possible to generate the eTag based on the response payload or if there is a reason for that generation to be made the way it is now.

I would be very grateful if you could help me with this doubt.

Congratulations for you nice work!

Guilherme
ReplyDelete
Replies
aliostad3 April 2013 at 19:56
Hi Guilherme,

Default implementation of eTag store is an InMemory one but you could use persistent ones such as Memcached, SQL Server, Redis, etc. So your best bet is to use a persistent one so it does not be lost. It seems that you have already taken a note of that.

I have been thinking of adding a feature to generate eTag based on the content. This is not there yet and might be a while until done.

Your best option is to do this:

1- Define a custom header name such as x-cachecow-content-hash
2- Pass that when as one of the varyByHeaders headers when you initialise the CachingHandler
3- Generate the content hash and add the header to REQUEST (and not RESPONSE)
4- Create a Func and set to the value of ETagValueGenerator to read from the value of the headers passed.

Any problems let me know.

ReplyDelete
Replies
Jay Dean4 June 2014 at 19:30
I am trying to by-pass caching on the server for some routes in my Web-Api project however when using an attribute based caching policy, I noticed based on the HTTP responses that the policy works but the SQL Server store is still updated with the cache state info. How can I prevent the CacheState table from adding and updating for resources I do not want cached. I attempted to use the HTTPCacheControlPolicy with the cacheControlHeaderValueFactory parameter but it is not entirely clear to me how this should be implemented. Can you please shed some light as to how I can go about addressing this issue. thanks.
ReplyDelete
Replies
Anonymous19 July 2014 at 21:32
Hi. Going back to the point you make in the post about legacy systems changing state and this not being picked up by the Handler, is it then OK (Best Practice) to manually remove the data from the store? (SQL, Memcache, whatever) ?
ReplyDelete
Replies
Mulk8728 July 2014 at 19:02
Hi. Based on the information in this article I added the following to my CachingHander created in my Application_Start (Global.asax.cs).

var cachecow = new CachingHandler(GlobalConfiguration.Configuration);

cachecow.CacheControlHeaderProvider = new AttributeBasedCacheControlPolicy(
new CacheControlHeaderValue()
{
NoCache = true
}).GetCacheControl;

I now get the following error ,"exceptionMessage":"Multiple actions were found that match the request:...

Any advice in troubleshooting/resolving would be appreciated. Thanks
ReplyDelete
Replies

Add comment

Note: only a member of this blog may post a comment.