Metadata-Version: 1.1
Name: Carpenter
Version: 1.0.0
Summary: A utility library which repairs and analyzes tablular data
Home-page: https://github.com/OpenGov/carpenter
Author: Matthew Seal
Author-email: mseal@opengov.us
License: LGPL 2.1
Download-URL: https://github.com/OpenGov/carpenter/tarball/v1.0.0
Description: Carpenter

        =========

        

        A Python repository which repairs and analyzes tablular data

        

        Description

        -----------

        

        This module provides the capability to extract and repair blocks of data

        from 2D tables. These blocks can then be individually processed,

        stitched together, or filtered as needed by a particular program.

        

        Autoconversions of cells along with a multi-tier flagging system for

        each magnitude of change allows for a wide variety of error handling.

        Additionally missing titles can be repaired from surrounding cells in

        order to generate compelete blocks from implied headings.

        

        Dependencies

        ------------

        

        -  allset

        -  pydatawrap

        

        Setup

        -----

        

        Installation

        ~~~~~~~~~~~~

        

        From source:

        

        ::

        

            python settup.py install

        

        From pip:

        

        ::

        

            pip install carpenter

        

        Features

        --------

        

        -  Block detection

        -  Title repairing

        -  Tunable cell conversions

        -  Column re-orienting

        

        Navigating the Repo

        -------------------

        

        carpenter

        ~~~~~~~~~

        

        The top level/front facing objects/functions

        

        carpenter/blocks

        ~~~~~~~~~~~~~~~~

        

        The block processing implementation detail files for the repository

        

        carpenter/regex

        ~~~~~~~~~~~~~~~

        

        The regex suite used to perform cell type identification

        

        tests

        ~~~~~

        

        All unit tests for the repo.

        

        Language Preferences

        --------------------

        

        -  Google Style Guide

        -  Object Oriented (with a few exceptions)

        

        TODO

        ----

        

        -  Add refactor top-level functionality

        -  Add new usable functions

        -  Separate flagging some from block iteration code

        

        Author

        ------

        

        Author(s): Matthew Seal

        

        © Copyright 2013, `OpenGov <http://opengov.com>`__

        
Keywords: tables,data,analysis,extraction
Platform: UNKNOWN
Classifier: Development Status :: 3 - Alpha
Classifier: Topic :: Utilities
Classifier: License :: OSI Approved :: GNU Lesser General Public License v2 (LGPLv2)
Classifier: Natural Language :: English
Classifier: Programming Language :: Python :: 2 :: Only
