Skip to content
/ q Public
forked from harelba/q

q - Run SQL directly on CSV or TSV files

License

Notifications You must be signed in to change notification settings

IRebri/q

 
 

Repository files navigation

Build Status

q - Text as Data

q is a command line tool that allows direct execution of SQL-like queries on CSVs/TSVs (and any other tabular text files).

q treats ordinary files as database tables, and supports all SQL constructs, such as WHERE, GROUP BY, JOINs, etc. It supports automatic column name and type detection, and q provides full support for multiple character encodings.

q's web site is http://harelba.github.io/q/. It contains everything you need to download and use q immediately.

Installation.

Extremely simple.

Instructions for all OSs are here.

Examples

q "SELECT COUNT(*) FROM ./clicks_file.csv WHERE c3 > 32.3"

ps -ef | q -H "SELECT UID, COUNT(*) cnt FROM - GROUP BY UID ORDER BY cnt DESC LIMIT 3"

Go here for more examples.

Benchmark

I have created a preliminary benchmark comparing q's speed between python2, python3, and comparing both to textql and octosql.

Your input about the validity of the benchmark and about the results would be greatly appreciated. More details are here.

Contact

Any feedback/suggestions/complaints regarding this tool would be much appreciated. Contributions are most welcome as well, of course.

Linkedin: Harel Ben Attia

Twitter @harelba

Email [email protected]

q on twitter: #qtextasdata

About

q - Run SQL directly on CSV or TSV files

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 92.7%
  • Shell 5.2%
  • JavaScript 1.0%
  • HTML 0.4%
  • Makefile 0.4%
  • CSS 0.2%
  • Batchfile 0.1%