Chem4Word – Chemistry Add-In for Microsoft Word
The Chemistry Add-In for Microsoft Word (Chem4Word) is a chemistry-aware add-in for Microsoft Word which is sponsored and supported by the .NET Foundation (https://dotnetfoundation.org).
It works with Office (Word) 2010 or greater, running on Windows 7 or greater.
Microsoft Office Extensibility
Starting with Microsoft Word 2007 (or later) users are given the option to store documents as Office Open XML files with a “.docx” extension. Such documents are no longer stored as binary files, but are extensible mark-up language (XML) files describing how the document is laid out, this XML data is compressed using the industry standard ZIP compression algorithm before being saved with the “.docx” extension. The “x” of “.docx” signifies that the files contain compressed xml not the older binary data.
Microsoft Office allows a programmer to use the Microsoft Office extensibility layer called “Visual Studio Tools for Office” (VSTO) to write a .net program which will enhance the capabilities of office applications, such as Microsoft Word. They do this by writing a special program called an add-in. Once an add-in is installed it runs when Microsoft Word is started. Typically an add-in will create an extra Microsoft Office Ribbon to allow the user to interact with the document using its functions.
Microsoft Word add-ins can store complete XML documents called Custom Xml Parts inside a special area of a document. Microsoft Word also has containers called Content Controls, which can store various types of content such as drawings.
How the Chem4Word Add-In works
The Chem4Word add-in stores the chemical structures as Chemical Mark-up Language (CML) in the aforementioned Custom XML Parts then renders the structure inside a Custom Control using Drawing ML. The Chem4Word add-in also allows the user to depict the structure by one of its textual descriptors. When you edit a chemistry structure and save the results, all of its linked visualisations are also updated. A ChemSpider web service is then used to generate a code known as an InChiKey, this uniquely identifies the structure, the InChiKey is stored as an additional textual descriptor element inside the CML.
One big advantage of having the chemistry embedded as machine readable XML is that the data is very easily imported into other information systems such as SharePoint, allowing unique chemical structures to be catalogued and search.
The Chem4Word add-in allows the user to interact with chemistry in a document as follows
- Edit / draw your structures using an embedded open source chemical structure editor called ChemDoodle Web Sketcher.
- Edit textual descriptors for a structure.
- Import chemical structures from web services such as PubChem and Opsin.
- Import files in CML or MDL Molfile format.
- Export drawn or imported structures in CML or MDL Molfile format. You can use this option to copy drawings to other documents of share them with colleagues.
A screen shot of Microsoft Office 2016 showing the Chem4Word ribbon and an embedded structure is shown below.