Metadata-Version: 1.0
Name: transmogrify.htmlcontentextractor
Version: 1.0b1
Summary: This blueprint extracts out title, description and body from html either via xpath or by automatic cluster analysis
Home-page: http://github.com/djay/transmogrify.htmlcontentextractor
Author: Dylan Jay
Author-email: software@pretaweb.com
License: GPL
Description: Introduction
        ============
        
        transmogrify.htmlcontentextractor
        This blueprint extracts out title, description and body from html
        either via xpath or by automatic cluster analysis
        
        Changelog
        =========
        
        1.0b1 (2010-11-03)
        ------------------
        
        - ignore already found items. better debug
        ["Dylan Jay"]
        
        - skip templates if item already parsed
        ["Dylan Jay"]
        
        - print automaticly found XPaths
        ["Dylan Jay"]
        
        - make text fields strip tail text
        ["Vitaliy Podoba"]
        
        1.0dev (2010-03-22)
        -------------------
        
        - split the auto templatefinder out to it's own blueprint
        ["Dylan Jay"]
        
        
        
        
Keywords: transmogrifier blueprint funnelweb source plone import conversion microsoft office
Platform: UNKNOWN
Classifier: Programming Language :: Python
Classifier: Topic :: Software Development :: Libraries :: Python Modules
