github

This data as json, CSV

html_url	issue_url	id	node_id	user	created_at	updated_at	author_association	body	reactions	issue	performed_via_github_app
https://github.com/simonw/datasette/issues/1101#issuecomment-869812567	https://api.github.com/repos/simonw/datasette/issues/1101	869812567	MDEyOklzc3VlQ29tbWVudDg2OTgxMjU2Nw==	9599	2021-06-28T16:06:57Z	2021-06-28T16:07:24Z	OWNER	Relevant blog post: https://simonwillison.net/2021/Jun/25/streaming-large-api-responses/ - including notes on efficiently streaming formats with some kind of separator in between the records (regular JSON). > Some export formats are friendlier for streaming than others. CSV and TSV are pretty easy to stream, as is newline-delimited JSON. > > Regular JSON requires a bit more thought: you can output a `[` character, then output each row in a stream with a comma suffix, then skip the comma for the last row and output a `]`. Doing that requires peeking ahead (looping two at a time) to verify that you haven't yet reached the end. > > Or... Martin De Wulf [pointed out](https://twitter.com/madewulf/status/1405559088994467844) that you can output the first row, then output every other row with a preceeding comma---which avoids the whole "iterate two at a time" problem entirely.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	749283032
https://github.com/simonw/datasette/issues/1101#issuecomment-869191854	https://api.github.com/repos/simonw/datasette/issues/1101	869191854	MDEyOklzc3VlQ29tbWVudDg2OTE5MTg1NA==	25778	2021-06-27T16:42:14Z	2021-06-27T16:42:14Z	CONTRIBUTOR	This would really help with this issue: https://github.com/eyeseast/datasette-geojson/issues/7	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	749283032
https://github.com/simonw/datasette/issues/1101#issuecomment-755134771	https://api.github.com/repos/simonw/datasette/issues/1101	755134771	MDEyOklzc3VlQ29tbWVudDc1NTEzNDc3MQ==	9599	2021-01-06T07:28:01Z	2021-01-06T07:28:01Z	OWNER	With this structure it will become possible to stream non-newline-delimited JSON array-of-objects too - the `stream_rows()` method could output `[` first, then each row followed by a comma, then `]` after the very last row.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	749283032
https://github.com/simonw/datasette/issues/1101#issuecomment-755133937	https://api.github.com/repos/simonw/datasette/issues/1101	755133937	MDEyOklzc3VlQ29tbWVudDc1NTEzMzkzNw==	9599	2021-01-06T07:25:48Z	2021-01-06T07:26:43Z	OWNER	Idea: instead of returning a dictionary, `register_output_renderer` could return an object. The object could have the following properties: - `.extension` - the extension to use - `.can_render(...)` - says if it can render this - `.can_stream(...)` - says if streaming is supported - `async .stream_rows(rows_iterator, send)` - method that loops through all rows and uses `send` to send them to the response in the correct format I can then deprecate the existing `dict` return type for 1.0.	{ "total_count": 2, "+1": 2, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	749283032
https://github.com/simonw/datasette/issues/1101#issuecomment-755128038	https://api.github.com/repos/simonw/datasette/issues/1101	755128038	MDEyOklzc3VlQ29tbWVudDc1NTEyODAzOA==	9599	2021-01-06T07:10:22Z	2021-01-06T07:10:22Z	OWNER	Yet another use-case for this: I want to be able to stream newline-delimited JSON in order to better import into Pandas: pandas.read_json("https://latest.datasette.io/fixtures/compound_three_primary_keys.json?_shape=array&_nl=on", lines=True)	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	749283032
https://github.com/simonw/datasette/issues/1101#issuecomment-732544590	https://api.github.com/repos/simonw/datasette/issues/1101	732544590	MDEyOklzc3VlQ29tbWVudDczMjU0NDU5MA==	9599	2020-11-24T02:22:55Z	2020-11-24T02:22:55Z	OWNER	The trick I'm using here is to follow the `next_url` in order to paginate through all of the matching results. The loop calls the `data()` method multiple times, once for each page of results: https://github.com/simonw/datasette/blob/4bac9f18f9d04e5ed10f072502bcc508e365438e/datasette/views/base.py#L304-L307	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	749283032
https://github.com/simonw/datasette/issues/1101#issuecomment-732543700	https://api.github.com/repos/simonw/datasette/issues/1101	732543700	MDEyOklzc3VlQ29tbWVudDczMjU0MzcwMA==	9599	2020-11-24T02:20:30Z	2020-11-24T02:20:30Z	OWNER	Current design: https://docs.datasette.io/en/stable/plugin_hooks.html#register-output-renderer-datasette ```python @hookimpl def register_output_renderer(datasette): return { "extension": "test", "render": render_demo, "can_render": can_render_demo, # Optional } ``` Where `render_demo` looks something like this: ```python async def render_demo(datasette, columns, rows): db = datasette.get_database() result = await db.execute("select sqlite_version()") first_row = " \| ".join(columns) lines = [first_row] lines.append("=" * len(first_row)) for row in rows: lines.append(" \| ".join(row)) return Response( "\n".join(lines), content_type="text/plain; charset=utf-8", headers={"x-sqlite-version": result.first()[0]} ) ``` Meanwhile here's where the CSV streaming mode is implemented: https://github.com/simonw/datasette/blob/4bac9f18f9d04e5ed10f072502bcc508e365438e/datasette/views/base.py#L297-L380	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	749283032