Home
This Title All WIREs
WIREs RSS Feed
How to cite this WIREs title:
WIREs Comput Mol Sci
Impact Factor: 8.127

Chemical patent information systems

Full article on Wiley Online Library:   HTML PDF

Can't access this content? Tell your librarian.

Abstract The chemical structure information in patents remains difficult to access, partly because it is frequently expressed in the form of Markush structures, which can encompass enormous numbers of individual compounds. Early search systems were based on chemical ‘fragment codes’ that have still not been entirely superseded by the ‘topological’ systems developed during the 1980s. There are a number of databases of specific patented structures, which can be searched using standard substructure search software, and the more recently developed ones use automated data mining techniques to extract chemical nomenclature from patent text and translate it into searchable representations. Although some work has been done on automatic reconstruction of searchable Markush structures from patent text, this has proved to be considerably more refractory. A number of alternative approaches to chemical patent searching are being explored, some involving similarity and nearest‐neighbor searching concepts, and some based on both existing curated databases and direct utilization of full‐text patents. In‐house systems, which facilitate integration with other cheminformatics systems, are also under development. These new systems may allow improvements in retrieval performance, especially with regard to search precision. © 2011 John Wiley & Sons, Ltd. WIREs Comput Mol Sci 2011 1 727‐741 DOI: 10.1002/wcms.41 This article is categorized under: Computer and Information Science > Chemoinformatics

Claims 1 and 10 from the US Patent on Viagra, illustrating the use of both Markush structures and specific chemical nomenclature. The third of the specific compounds listed in claim 10 is the one actually marketed as Viagra (Sildenafil).

[ Normal View | Magnified View ]

The hierarchy of MARPAT generic group nodes, with their applicable categories and attributes. For discussion, see Box 1.

[ Normal View | Magnified View ]

A simple Markush structure illustrating the four different types of variability. For discussion, see text.

[ Normal View | Magnified View ]

Related Articles

Representation of chemical structures
Automated systematic nomenclature generation for organic compounds

Browse by Topic

Computer and Information Science > Chemoinformatics

Access to this WIREs title is by subscription only.

Recommend to Your
Librarian Now!

The latest WIREs articles in your inbox

Sign Up for Article Alerts