Skip to content

martin-conur/dscribe

Repository files navigation

DScribe 🚧

CLI tool for tabular data description.

Fast summary statistics of tabular data. Works on numerical and categorical fields. Uses Arrow apache format, to perform basic data insight

  • Fast
  • Simple
  • Light
  • Reliable

Roadmap:

  • add csv, txt read capabilities
  • add summary for numerical data
  • add nan count
  • add summary for categorical data
  • add parquet and excel read capabilities
  • outlier detection? maybe
  • add date capabilities
  • analyze larger than ram files and sink outputs?

About

CLI tool for tabular data description.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published