Difference between revisions of "Open Source Data Quality Software"

From Open Risk Manual
(Criteria for inclusion)
 
 
(One intermediate revision by the same user not shown)
Line 22: Line 22:
 
| Java
 
| Java
 
| [https://github.com/datacleaner/DataCleaner github]
 
| [https://github.com/datacleaner/DataCleaner github]
 +
|-
 +
| openRefine
 +
| A tool for working with messy data
 +
| Java
 +
| [https://openrefine.org/ github]
 
|-
 
|-
 
| Validate
 
| Validate
Line 41: Line 46:
 
[[Category:Quantitative Tools]]
 
[[Category:Quantitative Tools]]
 
[[Category:Data Quality]]
 
[[Category:Data Quality]]
 +
[[Category:Open Source]]

Latest revision as of 12:36, 30 November 2019

Open Source Data Quality Software

The table below organizes available Open Source Data Quality software distributions.

Criteria for inclusion

Any open source distribution that is publicly accessible in one of the repositories. For brevity when a repository contains a number of distinct tools, only one link is provided

Open Source Data Quality Software
1. Name 2. Description 3. Language 4. URL
pyeve/cerberus Lightweight, extensible data validation library for Python Python github
Datacleaner Community edition Java github
openRefine A tool for working with messy data Java github
Validate Data cleaning for statistical purposes R github
missingno Missing data visualization module for Python Python github
pyvaru Rule based data validation library for python Python github

Contributors to this article

» Wiki admin