Thursday, 20 December 2012

Historical Debugging (Part 3) - IntelliTrace

When I first saw IntelliTrace introduced in Visual Studio 2010 I was initially disappointed and passed it by.  Sure, it was petty cool to make this part of your ALM ecosystem.  For example, whenever a test within a Microsoft Test Manager session or a unit test during a CI build failed, an iTrace file could then be stored on the TFS server against the failure and pulled down by a developer to investigate.  For me though, this wasn't where the functionality had the most potential benefit.  With internal testing, we already know the boundaries and set the parameters to ensure the test will fail.  They are written right there in the test.  The real benefit for me is with identifying the unknown cause of a problem within a production environment.  This is exactly what Microsoft gave us with Visual Studio 2012 and the new IntelliTrace Collector.

So what is IntelliTrace?  Well, it's like memory dump analysis on steroids.  However, unlike dump analysis where you can only see the state for frames within the stack, i.e. method entry and exit points, IntelliTrace can be configured to collect data at a far greater verbosity, down to the individual call level.  This provides a debugging experience much closer to that of live debugging except that the state information has already been collected and you can't purposely, or indeed accidentally modify it.

Another great benefit of IntelliTrace, when you have it configured to do so, is the ability to collect gesture information.  This means you no longer need to ask the tester or client for reproduction steps as all the information is right there within the .iTrace file.

IntelliTrace can also be incredibly useful for debugging applications that have a high cyclomatic complexity due to the client reconfigurability of the product.  No longer do you need to spend an enormous amount of time and expense trying to get an environment set up to match that of the client's configuration; if it's even possible that is.  As we are performing historical debugging against an iTrace file that has already collected the information pertaining to the bug, we can jump straight in and do what we do best.

One of the biggest criticisms people have of IntelliTrace however is the initial cost.  The ability to debug against a collected iTrace file is only available in the Ultimate version of Visual Studio.  True, this is an expensive product but I don't think it takes too long before you see a full return on that investment and indeed become better off, both financially and by gaining a strengthened reputation to respond quickly to difficult production issues.  In the very least, I think it makes perfect economic sense to kit out your customer facing maintenance teams with this feature.

Now that IntelliTrace can be used to step through production issues, it has become a formidable tool for the maintenance engineer.  Along side dump file analysis, we have the potential to make the "No repro" issue a thing of the past; only rearing it's head in non-deterministic or load based issues.

2 comments:

  1. Is there not still an issue with sensitive data coming back within the variables of these dump files. Arguably, there could be even more data than that simply brought back via screenshots etc.

    I'll admit I've not read up on it, but is there an easy way to automatically collect this data. I agree that it will prove most useful when trying to fix those edge cases, where you really need to see inside the code to work out what is going on - but those can often occur sporadically on site, and are likely not easily reproducible.

    ReplyDelete
  2. Hi Rob,

    You are correct and it is entirely possible for sensitive data to be present in these trace files but this can be addressed quite easily. While creating a collection plan you can explicitly specify which modules to instrument. As long as the sensitive data has been modelled within a small number of dedicated modules and not passed around freely, simply excluding these modules should omit the sensitive data from the output.

    Now VS2012 has given us the IntelliTrace Collector, we could potentially leave that running in the production environment but there may be performance concerns in doing so depending on the collection plan in use. These iTrace files can also get very large which is why the general recommendation is to turn IntelliTrace on as required rather than to leave it on. A shame since as you say, we may miss out on capturing the data on that sporadic, not easily reproducible issue.

    ReplyDelete