• About Jakob Engblom and this blog
Observations from Uppsala Computer Simulation, Virtual Platforms, Embedded Programming, Multicore and More (by Jakob Engblom)

Off-Topic: Analyzing Outlook Mailbox Size

2012 June 2 16:35 / 2 Comments / Jakob

Where I work, we use Exchange as our email server and Outlook as the primary client (at least I do). We also have an email quota that I keep bumping into, since I have a tendency to attract many emails with large attachments like image-happy PowerPoint files or binary code modules to patch things. I am also an extreme user of email folders. My main Outlook account contains some 650 folders, and my offline archive of all my old emails reaches towards 1300, with many 100s of thousands of emails for a total of almost 20 GB. So, pretty extreme.

My problem is: what do I do when the email system tells me (and it is serious, I can attest) that I am close to hitting my quota and that soon email will neither be received nor sent? I want to find the folders that are very large and candidates for some archiving. The answer has eluded me for a long time, until I stumbled upon a 2010 Youtube video: http://www.youtube.com/watch?v=3skJOd4GIak, from “tech-informer.com” (which now looks pretty dead). With some modifications, this solved my problem.

To do this, you need to have SnagIt installed. I do, since SnagIt is an essential tool for my work. Since things have a changed slightly since the video was posted, I will go through it quickly here with some updated screenshots and notes. The versions of software used:

  • SnagIt version 11
  • Outlook 2010
  • Excel 2010

We start in the mostly useless folder size dialog box in Outlook:

As you can see, the name column is too narrow to see the full name of each folder. The box itself cannot be resized to see more data, only the columns can be resized. You cannot sort by either size or total size. A pretty bad example of UI design.

To get the data out of this box to manipulate it, we use the SnagIt feature of text capture along with scrolling capture. In SnagtIt 11, you need to select Text capture along with “all in one”. When you start the capture, click the scroll arrow to capture the entire window contents as text (!).

The resulting capture is not the same as in the instruction video, as it is now tab-delimited rather than space-delimited. The capture looks like this:

I saved this as a .txt file (the default is .rtf), and opened it in Excel. In the Excel import dialog, I selected that the file was delimited by tab characters, and I got a neat table. With text containing “KB” in the size columns which does not sort neatly or allow analysis. To fix this, you have to do a search and replace from “empty space” “K” “B” to nothing. The problem is that the empty space is not a normal space but something else. To solve this, I simply edited a cell and copied the piece of the text that I wanted to replace (note that I select the little space before KB):

Next, do a global replace all:

After this, Excel magically realizes all the number columns are indeed numbers, and you can start manipulating and sort in order of size. The only remaining problem is that all names of folders are unreadable, at least when using deeply nested folder with long names like I do. To solve this, I took another text capture of the mailbox size dialog, with the names column expanded.

Since the resulting capture is just a single long column of names, and it has the same order as the previous capture, I can just copy the text from within the SnagIt editor and paste it into Excel. Thus, finally, I have an analyzable data set.

However, acting on it was more difficult than I thought. There were some pretty big folders, but the majority of the space is spent in some 100 folders each of some 2 to 3 MB each. I also noted that of my 650 folders, some 400 contained no data. Would be nice to be able to hide them in Outlook view. But moving them out of the way to the archive file is a ton of manual work, so I will let them be.

 

 

 

Tweet
Posted in: desktop software / Tagged: analysis, email, Excel, how-to, Outlook, SnagIt, Windows

2 Thoughts on “Off-Topic: Analyzing Outlook Mailbox Size”

  1. Bill Neifert on 2012 June 2 at 17:09 said:

    I used to work diligently to file my email and such like this but with the latest versions of Windows and Outlook, search works so well that I just leave everything in my inbox and create an archive of that every 2-3 months. I tag stuff for follow-up instead of leaving it unfiled if I need to look at it again. Takes a bit to get used to but it saved me a lot of time and hassle especially since I always had questions on where certain emails should be filed anyway. Just my $.02

  2. Jakob on 2012 June 2 at 18:12 said:

    That is certainly an approach to avoid the need for deep data analysis. It does not suit my way of working – indeed, a large portion of my use folders is not as a place to manually sort email into, but rather to separate out various incoming flows automatically. Support queues, internal and external forums, various internal email lists, …, all are sorted on the way in. In that way, I avoid drowning my inbox in things that are not necessarily directed at me. The problem with this is that if some email list suddenly lights up with 10s of MB of big emails, I have to go look at just where it happened.

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

Post Navigation

← Previous Post
Next Post →

Recent Posts

  • Wind River Blog: Simics 4.8 is Here
  • A Few Electrons too Many
  • Wind River Blog: Visuality NQ CIFS Server on Simics
  • Everything in the Cloud?
  • Wind River Blog: TCF and Simics
  • Off-Topic: Moving Bad Piggies Save Games
  • Two Cores, Four Cores, Eight Cores – Mobile Variety
  • Bliss: Failing to Pivot for Ideology
  • Wind River Blog and Movie: Demo of Simics Debugging
  • Simulation vs Reality in Schlock Mercenary
  • Programming like Lego
  • Does ISA Matter for Performance?
  • Wind River Blog: Debugging Simics using Simics
  • Wind River Blog: Simics and Flying Piggies
  • Dragons can be Useful – when AT Models Make Sense

Categories

  • appearances (30)
  • articles (21)
  • blogging (10)
  • books (6)
  • business issues (31)
  • computer architecture (35)
  • conferences (34)
  • EDA (50)
    • ESL (35)
  • embedded (78)
    • embedded software (57)
    • embedded systeme (50)
  • general research (6)
  • history (32)
    • general history (7)
    • history of computing (26)
  • off-topic (94)
    • biking (5)
    • board games (1)
    • computer games (3)
    • desktop software (35)
    • food and drink (1)
    • funny (12)
    • gadgets (24)
    • Politics (3)
    • popular culture (5)
    • trains (5)
    • transportation (10)
    • travel (10)
    • websites (3)
  • parallel computing (92)
    • multicore computer architecture (51)
    • multicore debug (22)
    • multicore software (65)
  • programming (107)
  • review (8)
  • security (19)
  • teaching (7)
  • testing (9)
  • uncategorized (12)
  • virtual things (129)
    • computer simulation technology (68)
    • virtual machines (17)
    • virtual platforms (98)
    • virtualization (14)
  • Wind River Blog (40)

Tags

ARM blog commentary Cadence Checkpointing clock-cycle models Communications of the ACM computer architecture conference cycle accuracy debugging DML Domain-specific languages embedded freescale G900 heterogeneous homogeneous IBM Intel iPod lego linux mobile phones multicore off-topic office 2007 operating systems p4080 podcast commentary power architecture rant research reverse debugging reverse execution S4D SiCS Multicore days Simics simulation software tools Sun SystemC video virtualization Vista Windows

1

  • F-Secure Blog

Blogs and news

  • Andras Vajda's blog (on multicore)
  • Embedded in Academia (John Regehr)
  • Grant Martin
  • Jack Ganssle
  • My Wind River Blog
  • Security Now podcast
  • Secworks (Joachim Strömbergson)
  • Simon Kågström
  • Synopsys View from the Top
  • Worse Than Failure

Archives

  • May 2013 (2)
  • April 2013 (1)
  • March 2013 (4)
  • February 2013 (1)
  • January 2013 (3)
  • December 2012 (2)
  • November 2012 (2)
  • October 2012 (1)
  • September 2012 (6)
  • August 2012 (4)
  • July 2012 (4)
  • June 2012 (3)
  • May 2012 (4)
  • April 2012 (2)
  • March 2012 (3)
  • February 2012 (1)
  • January 2012 (6)
  • December 2011 (2)
  • November 2011 (3)
  • October 2011 (4)
  • September 2011 (5)
  • August 2011 (4)
  • July 2011 (3)
  • June 2011 (4)
  • May 2011 (7)
  • April 2011 (1)
  • March 2011 (3)
  • February 2011 (5)
  • January 2011 (1)
  • December 2010 (4)
  • November 2010 (3)
  • October 2010 (5)
  • September 2010 (5)
  • August 2010 (5)
  • July 2010 (6)
  • June 2010 (5)
  • May 2010 (3)
  • April 2010 (4)
  • March 2010 (3)
  • February 2010 (4)
  • January 2010 (7)
  • December 2009 (6)
  • November 2009 (6)
  • October 2009 (7)
  • September 2009 (6)
  • August 2009 (7)
  • July 2009 (11)
  • June 2009 (5)
  • May 2009 (10)
  • April 2009 (7)
  • March 2009 (8)
  • February 2009 (9)
  • January 2009 (12)
  • December 2008 (8)
  • November 2008 (9)
  • October 2008 (9)
  • September 2008 (10)
  • August 2008 (13)
  • July 2008 (12)
  • June 2008 (8)
  • May 2008 (9)
  • April 2008 (10)
  • March 2008 (7)
  • February 2008 (8)
  • January 2008 (5)
  • December 2007 (5)
  • November 2007 (7)
  • October 2007 (7)
  • September 2007 (12)
  • August 2007 (9)
  • July 2007 (2)
© Copyright 2013 - Observations from Uppsala
Infinity Theme by DesignCoral / WordPress