home / github / issue_comments

Menu
  • Search all tables
  • GraphQL API

issue_comments: 1008537194

This data as json

html_url issue_url id node_id user created_at updated_at author_association body reactions issue performed_via_github_app
https://github.com/simonw/sqlite-utils/issues/364#issuecomment-1008537194 https://api.github.com/repos/simonw/sqlite-utils/issues/364 1008537194 IC_kwDOCGYnMM48HQ5q 9599 2022-01-10T04:29:53Z 2022-01-10T04:31:29Z OWNER

After a bunch of debugging with print() statements it's clear that the problem isn't with when things are committed or the size of the batches - it's that the data sent to standard input is all being processed in one go, not a line at a time.

I think that's because it is being buffered by this: https://github.com/simonw/sqlite-utils/blob/d2a79d200f9071a86027365fa2a576865b71064f/sqlite_utils/cli.py#L759-L770

The buffering is there so that we can sniff the first few bytes to detect if it's a CSV file - added in 99ff0a288c08ec2071139c6031eb880fa9c95310 for #230. So maybe for non-CSV inputs we should disable buffering?

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
1095570074  
Powered by Datasette · Queries took 0.974ms · About: github-to-sqlite