Skip to content

Extracts emails and attachments saved in Microsoft Outlook's .msg files

License

Notifications You must be signed in to change notification settings

username13107/msg-extractor

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

msg-extractor

Extracts emails and attachments saved in Microsoft Outlook's .msg files

The python script ExtractMsg.py automates the extraction of key email data (from, to, cc, date, subject, body) and the email's attachments.

To use it

  python ExtractMsg.py example.msg

This will produce a new folder named according to the date, time and subject of the message (for example "2013-07-24_0915 Example"). The email itself can be found inside the new folder along with the attachments.

The script uses Philippe Lagadec's Python module that reads Microsoft OLE2 files (also called Structured Storage, Compound File Binary Format or Compound Document File Format). This is the underlying format of Outlook's .msg files.

The script was built using Peter Fiskerstrand's documentation of the .msg format.

There are at least two major issues with version 0.1. The first is that email messages can be embedded in .msg files---the script doesn't like them at all and will dump a 'raw' directory instead of the normal output. This directory will contain all you need from the email, but in a less-than-ideal form. The second issue is that the script cannot extract the date of sent emails (as opposed to received emails).

If you have any questions feel free to contact me, Matthew Walker, at mattgwwalker at gmail.com.

About

Extracts emails and attachments saved in Microsoft Outlook's .msg files

Resources

License

Stars

Watchers

Forks

Packages

No packages published