• Recovering a corrupted Word 2007 document

    by  • August 14, 2009 • corruption, Word 2007, xml • 4 Comments

    I just had the fright of my life when opening one of my thesis chapters this morning, to which I got presented with the message:
    corrupted1

    I pressed OK, when I was asked whether I wanted to recover the document:

    corrupted2

    To which I was presented with the earlier error message, with Word making no attempt to recover the document at all:

    corrupted3

    At this point I was concerned I’d lost a whole days work, and what was even more interesting was that the file timestamp was not correct (6:10pm) as I had saved over the file later that evening (9:10pm). I work on a external USB hard disk and try to always ensure I safely remove hardware etc so I was at a bit of loss as to the reason why.

    So, knowing that the .docx is simply a number of XML files and other content within a zip file up I changed the extension from .docx to a .zip, and tried to open it in Windows Explorer. The zip handler in explorer couldn’t open it so I thought the zip container itself must be corrupted — not good!

    Doing a quick search for zip recovery I found a great, free tool called zip-repair from DiskInternals. I provided the tool the corrupted document with the zip extension, which recovered all of the files but some were in better shape than others:

    zip-repair-docx

    I then changed the extension on the recovered file back to a docx and tried to open again in Word 2007. I was greeted initially with the same error message that the file was corrupted, but the recovery attempt succeeded this time.I was presented with the document, content intact.

    Some of figures from Visio was bombed and the formatting of the document fairly non-existent, but it had still preserved headers and inline cross-references including bibliographic references, which were the more time consuming things. So I was able to just cut and paste the relevant material, import the bibliography XML from Jabref and complete data loss was averted.

    Perhaps this is something for Microsoft to look at including in Office 2010 – try to recover the zip file before working on the content? Hope this helps someone and of course: your milage may vary.

    About

    .NET developer at thetrainline.com, previously web developer at MRM Meteorite. Awarded a PhD in misbehaviour detection in wireless ad-hoc networks.A keen C# ASP.net developer bridging the gap with APIs and JavaScript frameworks, one web app at a time.

    http://www.paulkiddie.com

    4 Responses to Recovering a corrupted Word 2007 document

    1. Harriet
      April 20, 2010 at 8:26 pm

      wow, many thanks your “Recovering a corrupted Word 2007 document” post has just saved half my dissertation!

      Best wishes,
      H x

    2. April 21, 2010 at 9:33 am

      Glad it helped :) Know how frustrating it can be to lose work! I havent managed to get hold of Word 2010 RTM to see if they’ve improved the repair facility – heres hoping they do!

      Take care,
      Paul

    3. Bert Leen
      November 15, 2012 at 7:17 am

      Sometimes a word document file from a different version of Office or after being transferred from a different PC, will give you an error stating that the document cannot be read because it is complete problem of word file corruption. Use Kernel for word file repair tool to repair word .doc and .docx file for computer system.

    4. Hardik
      January 28, 2013 at 1:34 pm

      dear sir i tried to recover my losted and correpted file with ur app but initially it says browse box to select CORROUTED file and other browse box to select RECOVERED file……..here is my confusion that what does it mean by RECOVERED FILE? i only have recovered a corruped file which was lost when i formatted my pen drive
      plz help me

    Leave a Reply

    Your email address will not be published. Required fields are marked *