{"html_url": "https://github.com/simonw/sqlite-utils/pull/333#issuecomment-979345527", "issue_url": "https://api.github.com/repos/simonw/sqlite-utils/issues/333", "id": 979345527, "node_id": "IC_kwDOCGYnMM46X6B3", "user": {"value": 2118708, "label": "Florents-Tselai"}, "created_at": "2021-11-25T16:31:47Z", "updated_at": "2021-11-25T16:31:47Z", "author_association": "NONE", "body": "Thanks for your reply @simonw . Tbh, my first attempt was actually the `parquet-to-sqlite` package but I already had Makefiles that relied on `SQLite-utils` and it was less intrusive to my workflow. Maybe I'll revisit that decision.\r\nFYI: there's a `[sqlite-parquet-vtable](https://github.com/cldellow/sqlite-parquet-vtable)`\r\n\r\nI don't think plugins make much sense either. Probably defeats the purpose of simplicity: simple database along with a pip-able package.", "reactions": "{\"total_count\": 0, \"+1\": 0, \"-1\": 0, \"laugh\": 0, \"hooray\": 0, \"confused\": 0, \"heart\": 0, \"rocket\": 0, \"eyes\": 0}", "issue": {"value": 1039037439, "label": "Add functionality to read Parquet files."}, "performed_via_github_app": null} {"html_url": "https://github.com/simonw/sqlite-utils/issues/173#issuecomment-956041692", "issue_url": "https://api.github.com/repos/simonw/sqlite-utils/issues/173", "id": 956041692, "node_id": "IC_kwDOCGYnMM44_Anc", "user": {"value": 2118708, "label": "Florents-Tselai"}, "created_at": "2021-11-01T08:42:24Z", "updated_at": "2021-11-01T08:42:24Z", "author_association": "NONE", "body": "> I know how to build this for CSV and TSV - I can read them via a file wrapper that counts how many bytes it has seen.\r\n> \r\n> Not sure how to do it for JSON though. Maybe I could provide it just for newline-delimited JSON? Again I can measure progress based on how many bytes have been read.\r\n\r\nI was thinking about this, while inserting a stream of ~40M line-delimited json docs. Wouldn't a `--total-expected` flag work ? \r\n\r\nThat's [how tqdm does it](https://github.com/tqdm/tqdm/blob/fc69d5dcf578f7c7986fa76841a6b793f813df35/tqdm/std.py#L366)", "reactions": "{\"total_count\": 0, \"+1\": 0, \"-1\": 0, \"laugh\": 0, \"hooray\": 0, \"confused\": 0, \"heart\": 0, \"rocket\": 0, \"eyes\": 0}", "issue": {"value": 707478649, "label": "Progress bar for sqlite-utils insert"}, "performed_via_github_app": null} {"html_url": "https://github.com/simonw/sqlite-utils/issues/248#issuecomment-954303095", "issue_url": "https://api.github.com/repos/simonw/sqlite-utils/issues/248", "id": 954303095, "node_id": "IC_kwDOCGYnMM444YJ3", "user": {"value": 2118708, "label": "Florents-Tselai"}, "created_at": "2021-10-28T23:46:47Z", "updated_at": "2021-10-28T23:46:47Z", "author_association": "NONE", "body": "@mhalle maybe you can try out #333 ? ", "reactions": "{\"total_count\": 0, \"+1\": 0, \"-1\": 0, \"laugh\": 0, \"hooray\": 0, \"confused\": 0, \"heart\": 0, \"rocket\": 0, \"eyes\": 0}", "issue": {"value": 836829560, "label": "support for Apache Arrow / parquet files I/O"}, "performed_via_github_app": null}