github: issues: 691 rows where state = "open" and type = "issue" sorted by author

691 rows where state = "open" and type = "issue" sorted by author_association

Search:

descending

id	node_id	number	title	user	state	milestone	comments	created_at	updated_at	author_association ▼	body	repo	type	reactions
314834783	MDU6SXNzdWUzMTQ4MzQ3ODM=	219	Expose units in the JSON API?	russss 45057	open		0	2018-04-16T22:04:25Z	2018-04-16T22:04:25Z	CONTRIBUTOR	From #203: it would be nice for the JSON API to (optionally) return columns rendered with units in them - if, for example, you're consuming the JSON to render the rows on a map. I'm not entirely sure how useful this will be though - at the moment my map queries are custom SQL queries (a few have joins in, the rest might be fetching large amounts of data so it makes sense to limit columns fetched). Perhaps the SQL function is a better approach in general.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/219/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
341228846	MDU6SXNzdWUzNDEyMjg4NDY=	343	Render boolean fields better by default	russss 45057	open		1	2018-07-14T11:10:29Z	2018-07-14T14:17:14Z	CONTRIBUTOR	These show up as 0 or 1 because sqlite. I think Yes/No would be fine in most cases?	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/343/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
374953006	MDU6SXNzdWUzNzQ5NTMwMDY=	369	Interface should show same JSON shape options for custom SQL queries	gfrmin 416374	open	Datasette 1.0 3268330	2	2018-10-29T10:39:15Z	2020-05-30T17:24:06Z	CONTRIBUTOR	At the moment the page returning a custom SQL query shows the JSON and CSV APIs, but not the multiple JSON shapes. However, adding the `_shape` parameter to the JSON API URL manually still works, so perhaps there should be consistency in the interface by having the same "Advanced Export" box for custom SQL queries.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/369/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
377155320	MDU6SXNzdWUzNzcxNTUzMjA=	370	Integration with JupyterLab	psychemedia 82988	open		4	2018-11-04T13:57:13Z	2022-09-29T08:17:47Z	CONTRIBUTOR	I just watched a demo video for the JupyterLab Chart Editor which wraps the plotly chart editor app in a JupyterLab panel and lets you open a plotly chart JSON file in that editor. Essentially, it pops an HTML app into a panel in JupyterLab, and I think registers the app as a file viewer for a particular file type. (I'm not completely taken by it, tbh, because it means you can do irreproducible things to the chart definition file, but that's another issue). JupyterLab extensions can also open files from a dialogue as the iframe/html previewer shows: https://github.com/timkpaine/jupyterlab_iframe. This made me wonder about what `datasette` integration with JupyterLab might do. For example, by right-clicking on a CSV file (for which there is already a CSV table view) in the file browser, offer a View / Run as datasette file viewer option that will: run the CSV file through `csvs-to-sqlite`; launch the `datasette` server and display the `datasette` view in a JupyterLab panel. (? Create a new SQLite db for each CSV file and launch each datasette view on a new port? Or have a JupyterLab (session?) SQLite db that stores all `datasette` viewed CSVs and runs on a single port?) As a freebie, the `datasette` API would allow you to run efficient SQL queries against the file eg using using `pandas.read_sql()` queries in a notebook in the same space. Related: JupyterLab extensions docs a cookiecutter for wrting JupyterLab extensions using Javascript a cookiecutter for writing JupyterLab extensions using Typescript tutorial: Let’s Make an xkcd JupyterLab Extension	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/370/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
377166793	MDU6SXNzdWUzNzcxNjY3OTM=	372	Docker build tools	psychemedia 82988	open		0	2018-11-04T16:02:35Z	2018-11-04T16:02:35Z	CONTRIBUTOR	In terms of small pieces lightly joined, I note that there are several tools starting to appear for building generating Dockerfiles and building Docker containers from simpler components such as `requirements.txt` files. If plugin/extensions builders want to include additional packages, then things like incremental builds of composable builds that add additional items into a base `datasette` container may be required. Examples of Dockerfile generators / container builders: openshift/source-to-image (s2i) jupyter/repo2docker stencila/dockter Discussions / threads (via Binderhub gitter) on: - why `repo2docker` not `s2i` - why `dockter` not `repo2docker` - composability in `s2i` Relates to things like: https://github.com/simonw/datasette/pull/280	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/372/reactions", "total_count": 2, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 2, "rocket": 0, "eyes": 0 }
527710055	MDU6SXNzdWU1Mjc3MTAwNTU=	640	Nicer error message for heroku publish name clash	psychemedia 82988	open		1	2019-11-24T14:57:07Z	2019-12-06T07:19:34Z	CONTRIBUTOR	If you try to publish to Heroku using no set name (i.e. the default `datasette` name) and a project already exists under that name, you get a meaningful error report on the first line followed by Py error messages that drown it out: Creating datasette... ! ▸ Name datasette is already taken Traceback (most recent call last): File "/usr/local/bin/datasette", line 10, in <module> sys.exit(cli()) File "/usr/local/lib/python3.7/site-packages/click/core.py", line 764, in __call__ return self.main(args, kwargs) File "/usr/local/lib/python3.7/site-packages/click/core.py", line 717, in main rv = self.invoke(ctx) File "/usr/local/lib/python3.7/site-packages/click/core.py", line 1137, in invoke return _process_result(sub_ctx.command.invoke(sub_ctx)) File "/usr/local/lib/python3.7/site-packages/click/core.py", line 1137, in invoke return _process_result(sub_ctx.command.invoke(sub_ctx)) File "/usr/local/lib/python3.7/site-packages/click/core.py", line 956, in invoke return ctx.invoke(self.callback, ctx.params) File "/usr/local/lib/python3.7/site-packages/click/core.py", line 555, in invoke return callback(args, kwargs) File "/Users/NNNNN/Library/Python/3.7/lib/python/site-packages/datasette/publish/heroku.py", line 124, in heroku create_output = check_output(cmd).decode("utf8") File "/usr/local/Cellar/python/3.7.5/Frameworks/Python.framework/Versions/3.7/lib/python3.7/subprocess.py", line 411, in check_output kwargs).stdout File "/usr/local/Cellar/python/3.7.5/Frameworks/Python.framework/Versions/3.7/lib/python3.7/subprocess.py", line 512, in run output=stdout, stderr=stderr) subprocess.CalledProcessError: Command '['heroku', 'apps:create', 'datasette', '--json']' returned non-zero exit status 1. It would be neater if: the Py error message was caught; the report suggested setting a project name using `-n` etc. It may also be useful to provide a command to list the current names that are being used, which I assume is available via a Heroku call?	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/640/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
546073980	MDU6SXNzdWU1NDYwNzM5ODA=	74	Test failures on openSUSE 15.1: AssertionError: Explicit other_table and other_column	jayvdb 15092	open		3	2020-01-07T04:35:50Z	2020-01-12T07:21:17Z	CONTRIBUTOR	openSUSE 15.1 is using python 3.6.5 and click-7.0 , however it has test failures while openSUSE Tumbleweed on py37 passes. Most fail on the cli exit code like py [ 74s] =================================== FAILURES =================================== [ 74s] _________________________________ test_tables __________________________________ [ 74s] [ 74s] db_path = '/tmp/pytest-of-abuild/pytest-0/test_tables0/test.db' [ 74s] [ 74s] def test_tables(db_path): [ 74s] result = CliRunner().invoke(cli.cli, ["tables", db_path]) [ 74s] > assert '[{"table": "Gosh"},\n {"table": "Gosh2"}]' == result.output.strip() [ 74s] E assert '[{"table": "...e": "Gosh2"}]' == '' [ 74s] E - [{"table": "Gosh"}, [ 74s] E - {"table": "Gosh2"}] [ 74s] [ 74s] tests/test_cli.py:28: AssertionError packaging project at https://build.opensuse.org/package/show/home:jayvdb:py-new/python-sqlite-utils I'll keep digging into this after I have github-to-sqlite working on Tumbleweed, as I'll need openSUSE Leap 15.1 working before I can submit this into the main python repo.	sqlite-utils 140912432	issue	{ "url": "https://api.github.com/repos/simonw/sqlite-utils/issues/74/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
642572841	MDU6SXNzdWU2NDI1NzI4NDE=	859	Database page loads too slowly with many large tables (due to table counts)	abdusco 3243482	open		21	2020-06-21T14:23:17Z	2021-08-25T21:59:55Z	CONTRIBUTOR	Hey, I have a database that I save in HTML from couple of web scrapers. There are around 200k+, 50+ rows in a couple of tables, with sqlite file weighing around 600MB. The app runs on a VPS with 2 core CPU, 4GB RAM and refreshing database page regularly takes more than 10 seconds. I was suspecting that counting tables was the culprit, but manually running `select count() from table_name` for the largest table finishes under a second. I've looked at the source code. There's a check for index page for mutable databases larger than 100MB https://github.com/simonw/datasette/blob/799c5d53570d773203527f19530cf772dc2eeb24/datasette/views/index.py#L15 but this check is not performed for database page. I've manually crippled `Database::table_counts` method py async def table_counts(self, limit=10): if not self.is_mutable and self.cached_table_counts is not None: return self.cached_table_counts # Try to get counts for each table, $limit timeout for each count counts = {} for table in await self.table_names(): try: # table_count = ( # await self.execute( # "select count() from [{}]".format(table), # custom_time_limit=limit, # ) # ).rows[0][0] counts[table] = 10 # table_count # In some cases I saw "SQL Logic Error" here in addition to # QueryInterrupted - so we catch that too: except (QueryInterrupted, sqlite3.OperationalError, sqlite3.DatabaseError): counts[table] = None if not self.is_mutable: self.cached_table_counts = counts return counts now the page loads in <100ms. Is it possible to apply size check on database page too? /-/versions output { "python": { "version": "3.8.0", "full": "3.8.0 (default, Oct 28 2019, 16:14:01) \n[GCC 8.3.0]" }, "datasette": { "version": "0.44" }, "asgi": "3.0", "uvicorn": "0.11.5", "sqlite": { "version": "3.22.0", "fts_versions": [ "FTS5", "FTS4", "FTS3" ], "extensions": { "json1": null }, "compile_options": [ "COMPILER=gcc-7.4.0", "ENABLE_COLUMN_METADATA", "ENABLE_DBSTAT_VTAB", "ENABLE_FTS3", "ENABLE_FTS3_PARENTHESIS", "ENABLE_FTS3_TOKENIZER", "ENABLE_FTS4", "ENABLE_FTS5", "ENABLE_JSON1", "ENABLE_LOAD_EXTENSION", "ENABLE_PREUPDATE_HOOK", "ENABLE_RTREE", "ENABLE_SESSION", "ENABLE_STMTVTAB", "ENABLE_UNLOCK_NOTIFY", "ENABLE_UPDATE_DELETE_LIMIT", "HAVE_ISNAN", "LIKE_DOESNT_MATCH_BLOBS", "MAX_SCHEMA_RETRY=25", "MAX_VARIABLE_NUMBER=250000", "OMIT_LOOKASIDE", "SECURE_DELETE", "SOUNDEX", "TEMP_STORE=1", "THREADSAFE=1" ] } }	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/859/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
688670158	MDU6SXNzdWU2ODg2NzAxNTg=	147	SQLITE_MAX_VARS maybe hard-coded too low	simonwiles 96218	open		7	2020-08-30T07:26:45Z	2021-02-15T21:27:55Z	CONTRIBUTOR	I came across this while about to open an issue and PR against the documentation for `batch_size`, which is a bit incomplete. As mentioned in #145, while: `SQLITE_MAX_VARIABLE_NUMBER` ... defaults to 999 for SQLite versions prior to 3.32.0 (2020-05-22) or 32766 for SQLite versions after 3.32.0. it is common that it is increased at compile time. Debian-based systems, for example, seem to ship with a version of sqlite compiled with SQLITE_MAX_VARIABLE_NUMBER set to 250,000, and I believe this is the case for homebrew installations too. In working to understand what `batch_size` was actually doing and why, I realized that by setting `SQLITE_MAX_VARS` in `db.py` to match the value my sqlite was compiled with (I'm on Debian), I was able to decrease the time to `insert_all()` my test data set (~128k records across 7 tables) from ~26.5s to ~3.5s. Given that this about .05% of my total dataset, this is time I am keen to save... Unfortunately, it seems that `sqlite3` in the python standard library doesn't expose the `get_limit()` C API (even though `pysqlite` used to), so it's hard to know what value sqlite has been compiled with (note that this could mean, I suppose, that it's less than 999, and even hardcoding `SQLITE_MAX_VARS` to the conservative default might not be adequate. It can also be lowered -- but not raised -- at runtime). The best I could come up with is `echo "" \| sqlite3 -cmd ".limits variable_number"` (only available in `sqlite >= 2015-05-07 (3.8.10)`). Obviously this couldn't be relied upon in `sqlite_utils`, but I wonder what your opinion would be about exposing `SQLITE_MAX_VARS` as a user-configurable parameter (with suitable "here be dragons" warnings)? I'm going to go ahead and monkey-patch it for my purposes in any event, but it seems like it might be worth considering.	sqlite-utils 140912432	issue	{ "url": "https://api.github.com/repos/simonw/sqlite-utils/issues/147/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
698791218	MDU6SXNzdWU2OTg3OTEyMTg=	50	favorites --stop_after=N stops after min(N, 200)	mikepqr 370930	open		2	2020-09-11T03:38:14Z	2020-09-13T05:11:14Z	CONTRIBUTOR	For any number greater than 200, `favorites --stop_after` stops after getting 200 tweets, e.g. $ twitter-to-sqlite favorites tweets.db --stop_after=300 Importing favorites [####################################] 199 $ I don't think this is a limitation of the API (if you omit `--stop_after` you get some very large number, possibly all of them), so I think this is a bug.	twitter-to-sqlite 206156866	issue	{ "url": "https://api.github.com/repos/dogsheep/twitter-to-sqlite/issues/50/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
751195017	MDU6SXNzdWU3NTExOTUwMTc=	1111	Accessing a database's `.json` is slow for very large SQLite files	asg017 15178711	open		3	2020-11-26T00:27:27Z	2021-01-04T19:57:53Z	CONTRIBUTOR	I have a SQLite DB that's pretty large, 23GB and something like 300 million rows. I expect that most queries I run on it will be slow, which is fine, but there are some things that Datasette does that makes working with the DB very slow. Specifically, when I access the `.json` metadata for a table (which I believe it comes from `datasette/views/database.py`, it takes 43 seconds for the request to come in: bash $ time curl localhost:9999/out.json {"database": "out", "size": 24291454976, "tables": [{"name": "PageviewsHour", "columns": ["file", "code", "page", "pageviews"], "primary_keys": [], "count": null, "hidden": false, "fts_table": null, "foreign_keys": {"incoming": [], "outgoing": [{"other_table": "PageviewsHourFiles", "column": "file", "other_column": "file_id"}]}, "private": false}, {"name": "PageviewsHourFiles", "columns": ["file_id", "filename", "sha256", "size", "day", "hour"], "primary_keys": ["file_id"], "count": null, "hidden": false, "fts_table": null, "foreign_keys": {"incoming": [{"other_table": "PageviewsHour", "column": "file_id", "other_column": "file"}], "outgoing": []}, "private": false}, {"name": "sqlite_sequence", "columns": ["name", "seq"], "primary_keys": [], "count": 1, "hidden": false, "fts_table": null, "foreign_keys": {"incoming": [], "outgoing": []}, "private": false}], "hidden_count": 0, "views": [], "queries": [], "private": false, "allow_execute_sql": true, "query_ms": 43340.23213386536} real 0m43.417s user 0m0.006s sys 0m0.016s I suspect this is because a `COUNT()` is happening under the hood, which, when I run it through sqlite directly, does take around the same time: ```bash $ time sqlite3 out.db < <(echo "select count() from PageviewsHour;") 362794272 real 0m44.523s user 0m2.497s sys 0m6.703s ``` I'm using the `.json` request in the Observable Datasette Client to 1) verify that a link passed in is a reachable Datasette instance, and 2) a quick way to look at metadata for a db. A few different solutions I can think of: Have some other endpoint, like `/-/datasette.json` that the Observable Datasette client can fetch from to verify that the passed in URL is a valid Datasette (doesnt solve the slow problem, feel free to split this issue into 2) Have a way to turn off table counts when accessing a database's `.json` view, like `?no_count=1` or something Maybe have a timeout on the `table_counts()` function if it takes too long. which is odd, because it seems like it already does that (I think?), I can debug a little more if that's the case More than happy to debug further, or send a PR if you like one of the proposals above!	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1111/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
756875827	MDU6SXNzdWU3NTY4NzU4Mjc=	1129	Fix footer to the bottom of the page	abdusco 3243482	open		0	2020-12-04T07:28:07Z	2020-12-04T16:04:29Z	CONTRIBUTOR	Footer doesn't stick to the bottom if the body content isn't long enough to reach the end of viewport. This can be fixed using flexbox. ```css body { min-height: 100vh; display: flex; flex-direction: column; } .content { flex-grow: 1; } ```	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1129/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
779088071	MDU6SXNzdWU3NzkwODgwNzE=	54	Archive import appears to be broken on recent exports	jacobian 21148	open		5	2021-01-05T14:18:01Z	2023-01-04T11:06:55Z	CONTRIBUTOR	I requested a Twitter export yesterday, and unfortunately they seem to have changed it such that `twitter-to-sqlite import` can't handle it anymore 😢 So far I've ran into two issues. The first was easy to work around, but the second will take more investigation. If I can find the time I'll keep working on it and update this issue accordingly. The issues (so far): 1. Data seems to have moved to a `data/` subdirectory Running `twitter-to-sqlite import` on the raw zip file reports a bunch of "not yet implemented" errors, and then exits without actually importing anything: `❯ twitter-to-sqlite import tarchive.db twitter.zip ... data/manifest: not yet implemented data/account-creation-ip: not yet implemented data/account-suspension: not yet implemented ... (dozens of more lines like this, including critical stuff like data/tweets) ...` (`tarchive.db` now exists, but is empty) Workaround: unpack the zip file, and run `twitter-to-sqlite import tarchive.db path/to/archive/data` That gets further, but: 2. Some schema(s?) have changed At least, the `blocks` schema seems different now: ❯ twitter-to-sqlite import tarchive.db archive/data direct-messages-group: not yet implemented branch-links: not yet implemented periscope-expired-broadcasts: not yet implemented direct-messages: not yet implemented mute: not yet implemented Traceback (most recent call last): File "/Users/jacob/Library/Caches/pypoetry/virtualenvs/jacobian-dogsheep-4AXaN4tu-py3.8/bin/twitter-to-sqlite", line 8, in <module> sys.exit(cli()) File "/Users/jacob/Library/Caches/pypoetry/virtualenvs/jacobian-dogsheep-4AXaN4tu-py3.8/lib/python3.8/site-packages/click/core.py", line 829, in __call__ return self.main(args, kwargs) File "/Users/jacob/Library/Caches/pypoetry/virtualenvs/jacobian-dogsheep-4AXaN4tu-py3.8/lib/python3.8/site-packages/click/core.py", line 782, in main rv = self.invoke(ctx) File "/Users/jacob/Library/Caches/pypoetry/virtualenvs/jacobian-dogsheep-4AXaN4tu-py3.8/lib/python3.8/site-packages/click/core.py", line 1259, in invoke return _process_result(sub_ctx.command.invoke(sub_ctx)) File "/Users/jacob/Library/Caches/pypoetry/virtualenvs/jacobian-dogsheep-4AXaN4tu-py3.8/lib/python3.8/site-packages/click/core.py", line 1066, in invoke return ctx.invoke(self.callback, ctx.params) File "/Users/jacob/Library/Caches/pypoetry/virtualenvs/jacobian-dogsheep-4AXaN4tu-py3.8/lib/python3.8/site-packages/click/core.py", line 610, in invoke return callback(args, **kwargs) File "/Users/jacob/Library/Caches/pypoetry/virtualenvs/jacobian-dogsheep-4AXaN4tu-py3.8/lib/python3.8/site-packages/twitter_to_sqlite/cli.py", line 772, in import_ archive.import_from_file(db, filepath.name, open(filepath, "rb").read()) File "/Users/jacob/Library/Caches/pypoetry/virtualenvs/jacobian-dogsheep-4AXaN4tu-py3.8/lib/python3.8/site-packages/twitter_to_sqlite/archive.py", line 215, in import_from_file to_insert = transformer(data) File "/Users/jacob/Library/Caches/pypoetry/virtualenvs/jacobian-dogsheep-4AXaN4tu-py3.8/lib/python3.8/site-packages/twitter_to_sqlite/archive.py", line 115, in lists_member return {"lists-member": _list_from_common(data)} File "/Users/jacob/Library/Caches/pypoetry/virtualenvs/jacobian-dogsheep-4AXaN4tu-py3.8/lib/python3.8/site-packages/twitter_to_sqlite/archive.py", line 200, in _list_from_common for url in block["userListInfo"]["urls"]: KeyError: 'urls' That's as far as I got before I needed to work on something else. I'll report back if I get further!	twitter-to-sqlite 206156866	issue	{ "url": "https://api.github.com/repos/dogsheep/twitter-to-sqlite/issues/54/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
797097140	MDU6SXNzdWU3OTcwOTcxNDA=	60	Use Data from SQLite in other commands	daniel-butler 22578954	open		3	2021-01-29T18:35:52Z	2021-02-12T18:29:43Z	CONTRIBUTOR	As a total beginner here how could you access data from the sqlite table to run other commands. What I am thinking is I want to get all the repos in an organization then using the repo list pull all the commit messages for each repo. I love this project by the way!	github-to-sqlite 207052882	issue	{ "url": "https://api.github.com/repos/dogsheep/github-to-sqlite/issues/60/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
797784080	MDU6SXNzdWU3OTc3ODQwODA=	62	Stargazers and workflows commands always require an auth file when using GITHUB_TOKEN	frosencrantz 631242	open		0	2021-01-31T18:56:05Z	2021-01-31T18:56:05Z	CONTRIBUTOR	Requested fix in https://github.com/dogsheep/github-to-sqlite/pull/59 The stargazers and workflows commands always require an auth file, even when using a `GITHUB_TOKEN`. Other commands don't require the auth file.	github-to-sqlite 207052882	issue	{ "url": "https://api.github.com/repos/dogsheep/github-to-sqlite/issues/62/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
817989436	MDU6SXNzdWU4MTc5ODk0MzY=	242	Async support	eyeseast 25778	open		13	2021-02-27T18:29:38Z	2021-10-28T14:37:56Z	CONTRIBUTOR	Following our conversation last week, want to note this here before I forget. I've had a couple situations where I'd like to do a bunch of updates in an async event loop, but I run into SQLite's issues with concurrent writes. This feels like something sqlite-utils could help with. PeeWee ORM has a SQLite write queue that might be a good model. It's using threads or gevent, but I think that approach would translate well enough to asyncio. Happy to help with this, too.	sqlite-utils 140912432	issue	{ "url": "https://api.github.com/repos/simonw/sqlite-utils/issues/242/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
843884745	MDU6SXNzdWU4NDM4ODQ3NDU=	1283	advanced #export causes unexpected scrolling	mroswell 192568	open		0	2021-03-29T22:46:57Z	2021-03-29T22:46:57Z	CONTRIBUTOR	Visit a datasette table page Click on the "(advanced)" link. This adds a fragment identifier "#export" to the URL, and scrolls down to the "Advanced export" div with the "export" id. Manually scroll back up, and click on a suggested facet. The fragment identifier is still present, and the app scrolls back down to the "Advanced export" div. I think this is unwanted behavior. The user remedy seems to be to manually remove the "#export" from the URL. This behavior happens in my project, and in: https://covid-19.datasettes.com/covid/economist_excess_deaths (for instance) but not in this table: https://global-power-plants.datasettes.com/global-power-plants/global-power-plants	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1283/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
845794436	MDU6SXNzdWU4NDU3OTQ0MzY=	1284	Feature or Documentation Request: Individual table as home page template	mroswell 192568	open		4	2021-03-31T03:56:17Z	2021-11-04T03:15:01Z	CONTRIBUTOR	It would be great to have a sample showing how to move a single database that has a single table, to the index page. I'm trying it now, and find there is a real depth of Datasette and Python understanding that's required to be successful. I've got all the basic jinja concepts down... variables, template control structures, template inheritance, template overrides, css, html, the --template-dir and --static arguments, etc. But copying the table.html file to index.html doesn't work. There are undocumented functions and filters... I can figure some of them out (yay, url_builder.py and utils/init.py!) but it's a slog better handled by a much stronger Python developer. One sample would make a world of difference. The ideal form of this documentation would be a diff between the default table.html and how that would look if essentially moved to index.html. The use case is for everyone who wants to create a public-facing website to explore a single table at the root directory. (Maybe a second bit of documentation for people who have a single database with multiple tables.) (Hmm... might be cool to have a setting for that, where it happens automagically! If only one table, then home page is at the table level. if only one database, then home page is at the database level.... as an option.) I suppose I could ignore this, and somehow do this in the DNS settings once I hook up Vercel to a domain name, maybe.. and remove the breadcrumbs in table.html... but for now, a documentation request in the form of a diff... for viewing a single table (or a single database) at the root. (Actually, there's probably room for a whole expanded section on templates. Noticed some nice table metadata in one of the datasette examples, for instance... Hmm... maybe a whole library of solutions in one place... maybe a documentation hackathon! If that's of interest, of course it's a separate issue. )	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1284/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
847700726	MDU6SXNzdWU4NDc3MDA3MjY=	1285	Feature Request or Plugin Request: Numeric Range Facets	mroswell 192568	open		0	2021-04-01T01:50:20Z	2021-04-01T02:28:19Z	CONTRIBUTOR	It would be great to offer facets for numeric data ranges. The ranges could pull from typical GIS methods of creating choropleth maps. https://gisgeography.com/choropleth-maps-data-classification/ Of the following, for mapping, I've always preferred a Jenks Natural Breaks, or a cross between Jenks and Pretty breaks. Equal Intervals Quantile (equal count) Standard Deviation Natural Breaks (Jenks) Classification Pretty Breaks Some sort of Aggregate Jenks Classification (this isn't standard, but it would be nice to be able to set classification ranges that work across tables.) Here are some links for Natural Breaks, in case this method is unfamiliar. https://en.wikipedia.org/wiki/Jenks_natural_breaks_optimization http://wiki.gis.com/wiki/index.php/Jenks_Natural_Breaks_Classification https://medium.com/analytics-vidhya/jenks-natural-breaks-best-range-finder-algorithm-8d1907192051 Per that last link, there is a Jenks Python module... They also describe it as data-intensive for larger datasets. Maybe this is a good plugin idea. An example of equal Intervals would be 0 – < 10 10 – < 20 20 – < 30 30 – < 40 It's kind of confusing to have that less-than sign in there. it could also be displayed as: 0 – 10 10 – 20 20 – 30 30 – 40 But then it's not completely clear which category 10 is in, for instance. (Best to right-justify.. and use an "en dash" between numbers.)	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1285/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
849220154	MDU6SXNzdWU4NDkyMjAxNTQ=	1286	Better default display of arrays of items	mroswell 192568	open		5	2021-04-02T13:31:40Z	2021-06-12T12:36:15Z	CONTRIBUTOR	Would be great to have template filters that convert array fields to bullets and/or delimited lists upon table display: `\|to_bullets \|to_comma_delimited \|to_semicolon_delimited` or maybe: `\|join_array("bullet") \|join_array("bullet","square") \|join_array(";") \|join_array(",")` Keeping in mind that bullets show up in html as \<li> while other delimiting characters appear after the value. Of course, the fields themselves would remain as facetable arrays.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1286/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
853672224	MDU6SXNzdWU4NTM2NzIyMjQ=	1294	"You can check out any time you like. But you can never leave!"	mroswell 192568	open		0	2021-04-08T17:02:15Z	2021-04-08T18:35:50Z	CONTRIBUTOR	(Feel free to rename this one.) The column gear lets you "Show not-blank rows." Then it places a parameter in the URL, which a web developer would notice, but a lot of users won't notice, or know to delete it. Would be good to toggle "Show not-blank rows" with "Show all rows." (Also would be quite helpful to have a "Show blank rows \| Show all rows" option) The column gear lets you "Sort ascending" and "Sort descending" but then you're stuck with some sort of sorted version thereafter, unless you know to sort the ID column, or to remove the full _sort parameter and its value in the URL. Would be good to offer a "Remove sort" option in the gear. These requests are in the same camp as: https://github.com/simonw/datasette-vega/issues/36 I suspect there are other url parameter instances where similar analysis would be helpful, but the three above are the use cases I've run across. UPDATE: - It would be helpful to have a "Previous page" available for all but the first table page.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1294/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
855451460	MDU6SXNzdWU4NTU0NTE0NjA=	1297	Documentation: json1, and introspection endpoints	mroswell 192568	open		0	2021-04-12T00:38:00Z	2021-04-12T01:29:33Z	CONTRIBUTOR	https://docs.datasette.io/en/stable/facets.html notes that: If your SQLite installation provides the json1 extension (you can check using /-/versions) Datasette will automatically detect columns that contain JSON arrays... When I check -/versions I see two sections relevant to json1: `"extensions": { "json1": null }, "compile_options": [ ... "ENABLE_JSON1",` The ENABLE_JSON1 makes me think json1 is likely available. But the `"json1": null` made me think it wasn't available (because of the `null`). It would help if the documentation provided clarity about how to know if json1 is installed. It would also be helpful if the `/-/versions` information signalled somehow that that is to be appended to the hostname or domain name (or whatever you want to call it, or simply show it, using `example.com/-/versions` instead of `/-/versions`. Likewise on that last point, for https://docs.datasette.io/en/stable/introspection.html#introspection , at least at some point on that page detailing where those introspection endpoints go. (Sometimes documentation can be so abbreviated that it's hard for new users to figure out what's going on.)	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1297/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
855476501	MDU6SXNzdWU4NTU0NzY1MDE=	1298	improve table horizontal scroll experience	mroswell 192568	open		4	2021-04-12T01:55:16Z	2022-08-30T21:11:49Z	CONTRIBUTOR	Wide tables aren't a huge problem if you know to click and drag right. But it's not at all obvious to do that. (it also tends to blue-select any content as it's dragging.) Depending on column widths, public users might entirely miss all the columns to the right. There is a scrollbar at the bottom of the table, but I'm displaying ALL my records because it's the only way for datasette-vega to make accurate charts. So that bottom scrollbar is likely to be missed. I wonder if some sort of javascript-y mouseover to an arrow might help, similar to those seen in image carousels. Ah: here's a perfect example: Visit http://google.com Search for: animals endangered Note the 'g-right-button' (in the code) that looks like a right-facing caret in a circle. Click on that and the carousel scrolls right (and 'g-left-button' appears on the left). Might be tricky to do that on a table, rather than a one-row carousel, but it's worth experimenting with. Another option is just to put the scrollbars at the top of the table, too. Meantime, I'm trying to build a button like the "View/hide all columns on https://salaries.news.baltimoresun.com/salaries-be494cf/2019+Maryland+state+salaries Might be nice to have that available by default, with settings in the metadata showing which are on by default. (I saw some other closed issues related to horizontal scrolling, and admit I don't entirely understand them. For instance, the animated gif at https://github.com/simonw/datasette/issues/998#issuecomment-714117534 confuses me. )	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1298/reactions", "total_count": 4, "+1": 4, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
860722711	MDU6SXNzdWU4NjA3MjI3MTE=	1301	Publishing to cloudrun with immutable mode?	louispotok 5413548	open		1	2021-04-18T17:51:46Z	2022-10-07T02:38:04Z	CONTRIBUTOR	I'm a bit confused about immutable mode and publishing to cloudrun. (I want to publish with immutable mode so that I can support database downloads.) Running `datasette publish cloudrun --extra-options="-i example.db"` leads to an error: Error: Invalid value for '-i' / '--immutable': Path 'example.db' does not exist. However, running `datasette publish cloudrun example.db` not only works but seems to publish in immutable mode anyway! I'm seeing this both with `/-/databases.json` and the fact that downloads are working. When I just `datasette serve` locally, this succeeds both ways and works as expected.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1301/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
860734722	MDU6SXNzdWU4NjA3MzQ3MjI=	1302	Fix disappearing facets	mroswell 192568	open		0	2021-04-18T18:42:33Z	2021-04-20T07:40:15Z	CONTRIBUTOR	Clone https://github.com/mroswell/list-N Run `datasette disinfectants.db -o` Select the `Safer_or_Toxic` facet. Select `Toxic`. Close out the `Safer_or_Toxic` facet. Examine `Suggested facets` list. `Safer_or_Toxic` is GONE. Try some other facets. When you select an element, and then close the list, in some cases, the facet properly returns to the `Suggested facet` list... Arrays and dates properly return to the list, but fields with strings don't return to the list. Since my site is devoted to whether disinfectants are Safer or Toxic, having the suggested facet disappear from the suggested facet list is very confusing* to end-users. This, along with a few other issues, unfortunately proved beyond my own programming ability to address. So I hired a Senior-level developer to address a number of issues, including this disappearing act. Open a new terminal. Run `datasette disinfectants.db -m metadata.json --static static:static/ --template-dir templates/ --plugins-dir plugins/ -p 8001 -o` Repeat steps 3-6, but this time, the Safer_or_Toxic facet returns to the list (and the related URL parameters are removed). I'm not sure how to do a pull request for this, because the plugin contains other functionality that goes beyond this bug. I wanted the facets sorted in a certain order (both in the suggested facet list, and the detail lists) (... the detail lists were hopping around all over the place before...) I wanted the duplicate facets removed (leaving only the one where you can facet by individual item in an array.) I wanted the arrays to be presented in a prettier fashion (I did that in the template... That could be moved over to the plugin at some point) I'm thinking it'll be very helpful if applicable parts of my project's plugin (sort_suggested_facets_plugin.py) will be able to be incorporated back into datasette, but I leave that to you to consider. (* The disappearing facet bug was especially confusing because I'm removing the filters and sql from the table page, at the request of the organization. The filters and sql detail created a lot of confusion for end users who try to find disinfectants used by Hospitals, for instance, as an '=' won't find them, since they are part of the Use_site array.) My disappearing-facet confusion was documented in my own issue: https://github.com/mroswell/list-N/issues/57 (addressed by the plugin). Other facet-related issues here: https://github.com/mroswell/list-N/issues/54 (addressed by the plugin); https://github.com/mroswell/list-N/issues/15 (addressed by template); https://github.com/mroswell/list-N/issues/53 (not yet addressed).	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1302/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
950664971	MDU6SXNzdWU5NTA2NjQ5NzE=	1401	unordered list is not rendering bullet points in description_html on database page	fgregg 536941	open		2	2021-07-22T13:24:18Z	2021-10-23T13:09:10Z	CONTRIBUTOR	Thanks for this tremendous package, @simonw! In the `description_html` for a database, I have an unordered list. However, on the database page on the deployed site, it is not rendering this as a bulleted list. Page here: https://labordata-warehouse.herokuapp.com/nlrb-9da4ae5 The documentation gives an example of using an unordered list in a `description_html`, so I expected this will work.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1401/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
951185411	MDU6SXNzdWU5NTExODU0MTE=	1402	feature request: social meta tags	fgregg 536941	open		2	2021-07-23T01:57:23Z	2021-07-26T19:31:41Z	CONTRIBUTOR	it would be very nice if the twitter, slack, and other social media could make rich cards when people post a link to a datasette instance	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1402/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
959137143	MDU6SXNzdWU5NTkxMzcxNDM=	1415	feature request: document minimum permissions for service account for cloudrun	fgregg 536941	open		4	2021-08-03T13:48:43Z	2023-11-05T16:46:59Z	CONTRIBUTOR	Thanks again for such a powerful project. For deploying to cloudrun from github actions, I'd like to create a service account with minimal permissions. It would be great to document what those minimum permission that need to be set in the IAM.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1415/reactions", "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
959710008	MDU6SXNzdWU5NTk3MTAwMDg=	1419	`publish cloudrun` should deploy a more recent SQLite version	fgregg 536941	open		3	2021-08-04T00:45:55Z	2021-08-05T03:23:24Z	CONTRIBUTOR	I recently changed from deploying a datasette using `datasette publish heroku` to `datasette publish cloudrun`. A query that ran on the heroku site, now throws a syntax error on the cloudrun site. I suspect this is because they are running different versions of sqlite3. Heroku: sqlite3 3.31.1 (-/versions) Cloudrun: sqlite3 3.27.2 (-/versions) If so, it would be great to harmonize the sqlite3 versions across platforms update the docker files so as to update the sqlite3 version for cloudrun	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1419/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
988552851	MDU6SXNzdWU5ODg1NTI4NTE=	1456	conda install results in non-functioning `datasette serve` due to out-of-date asgiref	ctb 51016	open		0	2021-09-05T16:59:55Z	2021-09-05T16:59:55Z	CONTRIBUTOR	Over in https://github.com/ctb/2021-sourmash-datasette, I discovered that the following commands fail: `conda create -n datasette4 -y datasette=0.58.1 conda activate datasette4 datasette gathertax.db` with `ImportError: cannot import name 'WebSocketScope' from 'asgiref.typing'`. This appears to be because asgiref 3.3.4 doesn't have WebSocketScope, but later versions do - a simple `pip install asgiref==3.4.1` fixes the problem for me, at least to the point where I can run datasette and poke around as usual. I note that over in the conda-forge recipe, https://github.com/conda-forge/datasette-feedstock/blob/master/recipe/meta.yaml pins asgiref to < 3.4.0, but I'm not sure why - so I'm not sure how to best resolve this issue :).	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1456/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
988556488	MDU6SXNzdWU5ODg1NTY0ODg=	1459	suggestion: allow `datasette --open` to take a relative URL	ctb 51016	open		1	2021-09-05T17:17:07Z	2021-09-05T19:59:15Z	CONTRIBUTOR	(soft suggestion because I'm not sure I'm using datasette right yet) Over at https://github.com/ctb/2021-sourmash-datasette, I'm playing around with datasette, and I'm creating some static pages to send people to the right facets. There may well be better ways of achieving this end goal, and I will find out if so, I'm sure! But regardless I think it might be neat to support an option to allow `-o/--open` to take a relative URL, that then gets appended to the hostname and port. This would let me improve my documentation. I don't see any downsides, either, but 🤷 there may well be some :) Happy to dig in and provide a PR if it's of interest. I'm not sure off the top of my head how to support an optional value to a parameter in argparse - the current `-o` behavior is kinda nice so it'd be suboptimal to require a url for `-o`. Maybe `--open-url=` or something would work?	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1459/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
989109888	MDU6SXNzdWU5ODkxMDk4ODg=	1460	Override column metadata with metadata from another column	MichaelTiemannOSC 72577720	open		0	2021-09-06T12:13:33Z	2021-09-06T12:13:33Z	CONTRIBUTOR	I have a table from the PUDL project (https://github.com/catalyst-cooperative/pudl) that looks like this: CREATE TABLE fuel_ferc1 ( id INTEGER NOT NULL, record_id TEXT, utility_id_ferc1 INTEGER, report_year INTEGER, plant_name_ferc1 TEXT, fuel_type_code_pudl VARCHAR(7), fuel_unit VARCHAR(7), fuel_qty_burned FLOAT, fuel_mmbtu_per_unit FLOAT, fuel_cost_per_unit_burned FLOAT, fuel_cost_per_unit_delivered FLOAT, fuel_cost_per_mmbtu FLOAT, PRIMARY KEY (id), FOREIGN KEY(plant_name_ferc1, utility_id_ferc1) REFERENCES plants_ferc1 (plant_name_ferc1, utility_id_ferc1), CONSTRAINT fuel_ferc1_fuel_type_code_pudl_enum CHECK (fuel_type_code_pudl IN ('coal', 'oil', 'gas', 'solar', 'wind', 'hydro', 'nuclear', 'waste', 'unknown')), CONSTRAINT fuel_ferc1_fuel_unit_enum CHECK (fuel_unit IN ('ton', 'mcf', 'bbl', 'gal', 'kgal', 'gramsU', 'kgU', 'klbs', 'btu', 'mmbtu', 'mwdth', 'mwhth', 'unknown')) ); Note that `fuel_unit` is a unit that pint can understand, and that `fuel_qty_burned` is a column of data that could be expressed in terms of actual units, not merely as a dimensionless number. Ditto the `fuel_cost_per_unit_...` columns. Is there a way to give a column a default metadata unit (such as tons or USD/ton) and then let that be overridden when the metadata in another column says barrels or USD/gramsU? @catalyst-cooperative	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1460/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
991191951	MDU6SXNzdWU5OTExOTE5NTE=	1464	clean checkout & clean environment has test failures	ctb 51016	open		6	2021-09-08T14:16:23Z	2021-09-13T22:17:17Z	CONTRIBUTOR	I followed the instructions here, and even after running `python update-docs-help.py` I get the following failed tests -- any thoughts? `FAILED tests/test_api.py::test_searchable[/fixtures/searchable.json?_search=te+AND+do&_searchmode=raw-expected_rows3] FAILED tests/test_api.py::test_searchmode[table_metadata1-_search=te+AND+do-expected_rows1] FAILED tests/test_api.py::test_searchmode[table_metadata2-_search=te+AND+do&_searchmode=raw-expected_rows2]` This is with python 3.9.7 and lots of other packages, as in attached environment listing from `conda list`. conda-installed.txt	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1464/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
999902754	I_kwDOBm6k_c47mU4i	1473	base logo link visits `undefined` rather than href url	mroswell 192568	open		2	2021-09-18T04:17:04Z	2021-09-19T00:45:32Z	CONTRIBUTOR	I have two connected sites: http://www.SaferOrToxic.org (a Hugo website) and: http://disinfectants.SaferOrToxic.org/disinfectants/listN (a datasette table page) The latter is linked as "The List" in the former's menu. (I'd love a prettier URL, but that's what I've got.) On: http://disinfectants.SaferOrToxic.org/disinfectants/listN ... all the other menu links should point back to: https://www.SaferOrToxic.org And they do! But the logo, for some reason--though it has an href pointing to: https://www.SaferOrToxic.org Keeps going to this instead: https://disinfectants.saferortoxic.org/disinfectants/undefined What is causing that? How can I fix it? In #1284 back in March, I was doing battle with the index.html template, in a still unresolved issue. (I wanted only a single table page at the root.) But I thought, well, if I can't resolve that, at least I could just point the main website to the datasette page ("The List,") and then have the List point back to the home website. The menu hrefs to https://www.SaferOrToxic.org work just fine, exactly as they should, from the datasette page. Even the Home link works properly. But the logo link keeps rewriting to: https://disinfectants.saferortoxic.org/disinfectants/undefined This is the HTML: `<a class="text-3xl font-bold leading-none" href="https://www.saferortoxic.org"><img src="https://www.saferortoxic.org/images/logo_hu26e4dce8d5931af1ea33526b28fc8383_9734_c52a4f1635ef88bda858373270551ed2.webp" class="custom-logo" alt="Logo: Safer or Toxic?" width="300px"></a>` Is this somehow related to cloudflare? Or something in the datasette code? I'm starting to think it's a cloudflare issue. Can I at least rule out it being a datasette issue? My repository is here: https://github.com/mroswell/list-N (BTW, I couldn't figure out how to reference a local image, either, on the datasette side, which is why I'm using the image from the www home page.)	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1473/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1006781949	I_kwDOBm6k_c48AkX9	1478	Documentation Request: Feature alternative ID instead of default ID	mroswell 192568	open		0	2021-09-24T19:56:13Z	2021-09-25T16:18:54Z	CONTRIBUTOR	My data already has an ID that comes from a federal agency. Would love to have documentation on how to modify the template to: - Remove the generated ID from the table - Link the federal ID to the detail page - and to ensure that the JSON file uses that as the ID. I'd be happy to include the database ID in the export, but not as a key. I don't want to remove the ID from the database, though, because my experience with the federal agency is that data often has anomalies. I don't want all hell to break loose if they end up applying the same ID to multiple rows (which they haven't done yet). I just don't want it to display in the table or the data exports. Perhaps this isn't a template issue, maybe more of a db manipulation... Margie	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1478/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1015646369	I_kwDOBm6k_c48iYih	1480	Exceeding Cloud Run memory limits when deploying a 4.8G database	ghing 110420	open		9	2021-10-04T21:20:24Z	2022-10-07T04:39:10Z	CONTRIBUTOR	When I try to deploy a 4.8G SQLite database to Google Cloud Run, I get this error message: Memory limit of 8192M exceeded with 8826M used. Consider increasing the memory limit, see https://cloud.google.com/run/docs/configuring/memory-limits Unfortunately, the maximum amount of memory that can be allocated to an instance is 8192M. Naively profiling the memory usage of running Datasette with this database locally on my MacBook shows the following memory usage (using Activity Monitor) when I just start up Datasette locally: Real Memory Size: 70.6 MB Virtual Memory Size: 4.51 GB Shared Memory Size: 2.5 MB Private Memory Size: 57.4 MB I'm trying to understand if there's a query or other operation that gets run during container deployment that causes memory use to be so large and if this can be avoided somehow. This is somewhat related to #1082, but on a different platform, so I decided to open a new issue.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1480/reactions", "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1060631257	I_kwDOBm6k_c4_N_LZ	1528	Add new `"sql_file"` key to Canned Queries in metadata?	asg017 15178711	open		3	2021-11-22T21:58:01Z	2022-06-10T03:23:08Z	CONTRIBUTOR	Currently for canned queries, you have to inline SQL in your `metadata.yaml` like so: `yaml databases: fixtures: queries: neighborhood_search: sql: \|- select neighborhood, facet_cities.name, state from facetable join facet_cities on facetable.city_id = facet_cities.id where neighborhood like '%' \|\| :text \|\| '%' order by neighborhood title: Search neighborhoods` This works fine, but for a few reasons, I usually have my canned queries already written in separate `.sql` files. I'd like to instead re-use those instead of re-writing it. So, I'd like to see a new `"sql_file"` key that works like so: `metadata.yaml`: `yaml databases: fixtures: queries: neighborhood_search: sql_file: neighborhood_search.sql title: Search neighborhoods` `neighborhood_search.sql`: `sql select neighborhood, facet_cities.name, state from facetable join facet_cities on facetable.city_id = facet_cities.id where neighborhood like '%' \|\| :text \|\| '%' order by neighborhood` Both of these would work in the exact same way, where Datasette would instead open + include `neighborhood_search.sql` on startup. A few reasons why I'd like to keep my canned queries SQL separate from metadata.yaml: Keeping SQL in standalone SQL files means syntax highlighting and other text editor integrations in my code Multiline strings in yaml, while functional, are a tad cumbersome and are hard to edit Works well with other tools (can pipe `.sql` files into the `sqlite3` CLI, or use with other SQLite clients easier) Typically my canned queries are quite long compared to everything else in my metadata.yaml, so I'd love to separate it where possible Let me know if this is a feature you'd like to see, I can try to send up a PR if this sounds right!	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1528/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1077620955	I_kwDOBm6k_c5AOzDb	1549	Redesign CSV export to improve usability	fgregg 536941	open	Datasette 1.0 3268330	5	2021-12-11T19:02:12Z	2022-04-04T11:17:13Z	CONTRIBUTOR	Original title: Set content type for CSV so that browsers will attempt to download instead opening in the browser Right now, if the user clicks on the CSV related to a <s>table or a</s> query, the response header for the content type is "content-type: text/plain; charset=utf-8" Most browsers will try to open a file with this content-type in the browser. This is not what most people want to do, and lots of folks don't know that if they want to download the CSV and open it in the a spreadsheet program they next need to save the page through their browser. It would be great if the response header could be something like `'Content-type: text/csv'); 'Content-disposition: attachment;filename=MyVerySpecial.csv');` which would lead browsers to open a download dialog.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1549/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1079111498	I_kwDOBm6k_c5AUe9K	1553	if csv export is truncated in non streaming mode set informative response header	fgregg 536941	open		3	2021-12-13T22:50:44Z	2021-12-16T19:17:28Z	CONTRIBUTOR	streaming mode is currently not enabled for custom queries, so the queries will be truncated to max row limit. it would be great if a response is truncated that an header signalling that was set in the header. i need to write some pagination code for getting full results back for a custom query and it would make the code much better if i could reliably known when there is nothing more to limit/offset	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1553/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1090810196	I_kwDOBm6k_c5BBHFU	1583	consider adding deletion step of cloudbuild artifacts to gcloud publish	fgregg 536941	open		1	2021-12-30T00:33:23Z	2021-12-30T00:34:16Z	CONTRIBUTOR	right now, as part of the the publish process images and other artifacts are stored to gcloud's cloud storage before being deployed to cloudrun. after successfully deploying, it would be nice if the the script deleted these artifacts. otherwise, if you have regularly scheduled build process, you can end up paying to store lots of out of date artifacts.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1583/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1096536240	I_kwDOBm6k_c5BW9Cw	1586	run analyze on all databases as part of start up or publishing	fgregg 536941	open		1	2022-01-07T17:52:34Z	2022-02-02T07:13:37Z	CONTRIBUTOR	Running `analyze;` lets sqlite's query planner make much better use of any indices. It might be nice if the analyze was run as part of the start up of "serve" or "publish".	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1586/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1108671952	I_kwDOBm6k_c5CFP3Q	1605	Scripted exports	eyeseast 25778	open		10	2022-01-19T23:45:55Z	2022-11-30T15:06:38Z	CONTRIBUTOR	Posting this while I'm thinking about it: I mentioned at the end of this thread that I'm usually doing `datasette --get` to export canned queries. I used to use a tool called datafreeze to do scripted exports, but that project looks dead now. The ergonomics of it are pretty nice, though, and the `Freezefile.yml` structure is actually not too far from Datasette's canned queries. This is related to the idea for `datasette query` (#1356) but I think it's a distinct feature. It's most likely a plugin, but I want to raise it here because it's probably something other people have thought about.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1605/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1160034488	I_kwDOCGYnMM5FJLi4	411	Support for generated columns	eyeseast 25778	open		8	2022-03-04T20:41:33Z	2022-03-11T22:32:43Z	CONTRIBUTOR	This is a fairly new feature -- SQLite version 3.31.0 (2020-01-22) -- that I, admittedly, haven't gotten to work yet. But it looks incredibly useful: https://dgl.cx/2020/06/sqlite-json-support I'm not sure if this is an option on `add-column` or a separate command like `add-generated-column`. Either way, it needs an argument to populate it. It could be something like this: `sh sqlite-utils add-column data.db table-name generated --as 'json_extract(data, "$.field")' --virtual` More here: https://www.sqlite.org/gencol.html	sqlite-utils 140912432	issue	{ "url": "https://api.github.com/repos/simonw/sqlite-utils/issues/411/reactions", "total_count": 2, "+1": 2, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1163369515	I_kwDOBm6k_c5FV5wr	1655	query result page is using 400mb of browser memory 40x size of html page and 400x size of csv data	fgregg 536941	open		8	2022-03-09T00:56:40Z	2023-10-17T21:53:17Z	CONTRIBUTOR	this page is using about 400 mb in firefox 97 on mac os x. if you download the html for the page, it's about 11mb and if you get the csv for the data its about 1mb. it's using over a 1G on chrome 99. i found this because, i was trying to figure out why editing the SQL was getting very slow.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1655/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1182227211	I_kwDOBm6k_c5Gd1sL	1692	[plugins][feature request]: Support additional script tag attributes when loading custom JS	hydrosquall 9020979	open		2	2022-03-27T01:16:03Z	2022-03-30T06:14:51Z	CONTRIBUTOR	Motivation The build system for my new plugin has two output JS files, one for browsers that support ES modules, one for browsers that don't. At present, I'm only passing one of them into Datasette. I'd like to specify the non-es-module script as a fallback for older browsers. I don't want to load it by default, because browsers will only need one, and it's heavy, so for now I'm only supporting modern browsers. To be able to support legacy browsers without slowing down users with modern browsers, I would like to be able to set additional HTML attributes on the tag fallback script, `nomodule` and `defer`. My injected scripts should look something like this: ```html <script type="module" src="/index.my-es-module-bundle.js"></script> <script src="/index.my-legacy-fallback-bundle.js" nomodule="" defer></script> ``` Proposal To achieve this, I propose additional optional properties to the API accepted by the `extra_js_urls` hook and custom JS field the `metadata.json` described here. Under this API, I'd write something like this to get the above HTML rendered in Datasette. `json { "extra_js_urls": [ { "url": "/index.my-es-module-bundle.js", "module": true, }, { "url": "/index.my-legacy-fallback-bundle.js", "nomodule": "", "defer": true } ] }` Resources MDN on the script tag There may be other properties that could be added that are potentially valuable, like `async` or `referrerpolicy`, but I don't have an immediate need for those.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1692/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1193090967	I_kwDOBm6k_c5HHR-X	1699	Proposal: datasette query	eyeseast 25778	open		6	2022-04-05T12:36:43Z	2022-04-11T01:32:12Z	CONTRIBUTOR	I started sketching out a plugin to add a `datasette query` subcommand to export data from the command line. This is based on discussions in #1356 and #1605. Before I get too far down this rabbit hole, I figure it's worth getting some feedback here (unless this should happen in `Discussions`). Here's what I'm thinking: At its most basic, it will write the results of a query to STDOUT. `sh datasette query -d data.db 'select * from data' > results.json` This isn't much improvement over using sqlite-utils. To make better use of datasette and its ecosystem, run `datasette query` using a canned query defined in a `metadata.yml` file. For example, using the metadata file from alltheplaces-datasette: `sh cd alltheplaces-datasette datasette query -d alltheplaces.db -m metadata.yml count_by_spider` That query would be good to get as CSV, and we can auto-discover metadata and databases in the current directory: `sh cd alltheplaces-datasette datasette query count_by_spider -f csv` In this case, `count_by_spider` is a canned query defined on the `alltheplaces` database. If the same query is defined on multiple databases or its otherwise unclear which database `query` should use, pass the `-d` or `--database` option. If a query takes parameters, I can pass them in at runtime, using the `--param` or `-p` option: `sh datasette query -d data.db -p value something 'select * from neighborhoods where some_column = :value'` I'm very interested in feedback on this, including whether it should be a plugin or in Datasette core. (I don't have a strong opinion about this, but I'm prototyping it as a plugin to start.)	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1699/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1198822563	I_kwDOBm6k_c5HdJSj	1706	[feature] immutable mode for a directory, not just individual sqlite file	hydrosquall 9020979	open		4	2022-04-10T00:50:57Z	2022-12-09T19:11:40Z	CONTRIBUTOR	Motivation I have a directory of sqlite databases I'd like to use immutable mode when opening them for better performance docs Currently using this flag throws the following error IsADirectoryError: [Errno 21] Is a directory: '/name-of-directory' Proposal Immutable flag works for both single files and directories `datasette -i /folder-of-sqlite-files`	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1706/reactions", "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1200224939	I_kwDOBm6k_c5Hifqr	1707	[feature] expanded detail page	fgregg 536941	open		1	2022-04-11T16:29:17Z	2022-04-11T16:33:00Z	CONTRIBUTOR	Right now, if click on the detail page for a row you get the info for the row and links to related tables: It would be very cool if there was an option to expand the rows of the related tables from within this detail view. If you had that then datasette could fulfill a pretty common use case where you want to search for an entity and get a consolidate detail view about what you know about that entity.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1707/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1310243385	I_kwDOCGYnMM5OGLo5	456	feature request: pivot command	fgregg 536941	open		5	2022-07-20T00:58:08Z	2022-07-20T17:50:50Z	CONTRIBUTOR	pivoting long-format table to wide-format tables is pretty common and kind of pain. would love to see this feature in sqlite-utils!	sqlite-utils 140912432	issue	{ "url": "https://api.github.com/repos/simonw/sqlite-utils/issues/456/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1324659241	I_kwDOCGYnMM5O9LIp	459	Single quoted transform recipes on Windows do not work as expected	shakeel 19921	open		0	2022-08-01T16:14:54Z	2022-08-01T16:14:54Z	CONTRIBUTOR	Trying to follow the tutorial for sqlite-utils and datasette https://datasette.io/tutorials/clean-data on Windows 11 OS `Microsoft Windows [Version 10.0.22622.440]`, with sqlite-utils and datasette installed using pipx. `pipx list package datasette 0.61.1, installed using Python 3.10.4 - datasette.exe package sqlite-utils 3.28, installed using Python 3.10.4 - sqlite-utils.exe` In the step to transform dates into ISO dates the quoted value `'r.parsedatetime(value)'` is copied verbatim into the columns instead of applying the output of the Python recipe. ``` sqlite-utils convert manatees.db locations \ REPDATE created_date last_edited_date \ 'r.parsedatetime(value)' --dry-run 1975/01/31 00:00:00+00 --- becomes: r.parsedatetime(value) Would affect 13568 rows ``` However, if I change the code from single quotes to double quotes, it works as expected. ``` sqlite-utils convert manatees.db locations \ REPDATE created_date last_edited_date \ "r.parsedatetime(value)" --dry-run 1975/01/31 00:00:00+00 --- becomes: 1975-01-31T00:00:00+00:00 Would affect 13568 rows ``` Specifying the transform code recipe should work with single quotes on Windows.	sqlite-utils 140912432	issue	{ "url": "https://api.github.com/repos/simonw/sqlite-utils/issues/459/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1355193529	I_kwDOCGYnMM5Qxpy5	479	OperationalError: cannot VACUUM from within a transaction	chapmanjacobd 7908073	open		0	2022-08-30T05:34:24Z	2022-08-30T05:34:24Z	CONTRIBUTOR	Maybe when calling `.vacuum()` and other DB-level write-lock operations `sqlite_utils` could guard against this error message by automatically committing first? ``` 46 db["media"].optimize() # type: ignore ---> 47 db.vacuum() File ~/.local/lib/python3.10/site-packages/sqlite_utils/db.py:1047, in Database.vacuum(self) 1045 def vacuum(self): 1046 "Run a SQLite `VACUUM` against the database." -> 1047 self.execute("VACUUM;") File ~/.local/lib/python3.10/site-packages/sqlite_utils/db.py:470, in Database.execute(self, sql, parameters) 468 return self.conn.execute(sql, parameters) 469 else: --> 470 return self.conn.execute(sql) OperationalError: cannot VACUUM from within a transaction ``` It might also be nice to add a sentence or two about how transactions are committed on the docs page. When I was swapping out my sqlite3 code for this library it was nice that everything was pretty much drop-in but I was/am unsure what to do about the places I explicitly call `.commit()` in my code Related to https://github.com/simonw/sqlite-utils/issues/121	sqlite-utils 140912432	issue	{ "url": "https://api.github.com/repos/simonw/sqlite-utils/issues/479/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1393202060	I_kwDOCGYnMM5TCpOM	496	devrel/python api: Pylance type hinting	chapmanjacobd 7908073	open		4	2022-10-01T03:03:34Z	2023-05-03T05:53:27Z	CONTRIBUTOR	Pylance is generally pretty good at figuring out stuff but `sqlite-utils` has some quirks which make type hinting kinda useless. Maybe you don't care but I thought I would bring it to your attention. For example: `db["subs"].insert_all(subs, pk="index")` `Cannot access member "insert_all" for type "View" Member "insert_all" is unknown` `insert_all` and all the other methods show up as a type issues because the program can't know whether something is a View or a Table. Fair enough. But that basically throws all type checking out the window. `pk="index"` also shows up as a type issue: `Argument of type "Literal['index']" cannot be assigned to parameter "pk" of type "Default" in function "insert_all" "Literal['index']" is incompatible with "Default"` I think this is because DEFAULT is an empty class? maybe a few small changes could be made to make the library more type-friendly The interim solution is of course to turn off type hints completely for the line `db["subs"].insert_all(subs, pk="index") # type: ignore`	sqlite-utils 140912432	issue	{ "url": "https://api.github.com/repos/simonw/sqlite-utils/issues/496/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1400374908	I_kwDOBm6k_c5TeAZ8	1836	docker image is duplicating db files somehow	fgregg 536941	open		13	2022-10-06T22:35:54Z	2022-10-08T16:56:51Z	CONTRIBUTOR	if you look into the docker image created by docker publish, the `datasette inspect` line is duplicating the db files. here's the result of the inspect command:	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1836/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1410548368	I_kwDODFdgUs5UE0KQ	77	Feature: Support GitHub discussions	frosencrantz 631242	open		0	2022-10-16T16:53:38Z	2022-10-16T16:53:38Z	CONTRIBUTOR	Hi @simonw I've been a happy user of this tool. Thank you for writing it and sharing it. I wanted to suggest a feature request to support Discussions. For example the VisiData project has discussions https://github.com/saulpw/visidata/discussions , and it would be useful if there was a way to pull that data into the database. However, I'm not offering a pull request.	github-to-sqlite 207052882	issue	{ "url": "https://api.github.com/repos/dogsheep/github-to-sqlite/issues/77/reactions", "total_count": 2, "+1": 2, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1439009231	I_kwDOBm6k_c5VxYnP	1884	Exclude virtual tables from datasette inspect	eyeseast 25778	open		6	2022-11-07T21:26:01Z	2022-11-21T04:40:56Z	CONTRIBUTOR	Ran `inspect` on a spatialite database and got these warnings: `ERROR: conn=<sqlite3.Connection object at 0x119e46110>, sql = 'select count() from [SpatialIndex]', params = None: no such module: VirtualSpatialIndex ERROR: conn=<sqlite3.Connection object at 0x119e46110>, sql = 'select count() from [ElementaryGeometries]', params = None: no such module: VirtualElementary ERROR: conn=<sqlite3.Connection object at 0x119e46110>, sql = 'select count(*) from [KNN]', params = None: no such module: VirtualKNN` It still worked, but probably want to catch this.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1884/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1469796454	I_kwDOBm6k_c5Xm1Bm	1920	Document Datasette.metadata() method	eyeseast 25778	open		0	2022-11-30T15:10:36Z	2022-11-30T15:10:36Z	CONTRIBUTOR	Code is here: https://github.com/simonw/datasette/blob/main/datasette/app.py#L503 This will be the official way to access metadata from plugins.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1920/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1469821027	I_kwDOBm6k_c5Xm7Bj	1921	Document methods to get canned queries	eyeseast 25778	open		0	2022-11-30T15:26:33Z	2022-11-30T23:34:21Z	CONTRIBUTOR	Two methods will get canned queries for a Datasette instance: `Datasette.get_canned_queries` will return all canned queries for a database that an `actor` can see. `Datasette.get_canned_query` will return a single canned query by name.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1921/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1509783085	I_kwDOBm6k_c5Z_XYt	1969	sql-formatter javascript is not now working with CloudFlare rocketloader	fgregg 536941	open		0	2022-12-23T21:14:06Z	2023-01-10T01:56:33Z	CONTRIBUTOR	This is probably not a bug with datasette, but I thought you might want to know, @simonw. I noticed today that my CloudFlare proxied datasette instance lost the "Format SQL" option. I'm pretty sure it was there last week. In the CloudFlare settings, if I turn off Rocket Loader, I get the "Format SQL" option back. Rocket Loader works by asynchronously loading the javascript, so maybe there was a recent change that doesn't play well with the asynch loading? I'm up to date with https://github.com/simonw/datasette/commit/e03aed00026cc2e59c09ca41f69a247e1a85cc89	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1969/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1515815014	I_kwDOBm6k_c5aWYBm	1973	render_cell plugin hook's row object is not a sqlite.Row	cldellow 193185	open		4	2023-01-01T20:27:46Z	2023-01-29T00:40:31Z	CONTRIBUTOR	From https://docs.datasette.io/en/stable/plugin_hooks.html#render-cell-row-value-column-table-database-datasette: row - sqlite.Row The SQLite row object that the value being rendered is part of This appears to actually be a CustomRow, but I think that's unrelated to my issue. I have a table: `sql CREATE TABLE IF NOT EXISTS "dss_job_stats"( job_id integer not null references dss_job(id) on delete cascade, host text not null, // other columns elided as irrelevant primary key (job_id, host) );` On datasette 0.63.2, the `render_cell` hook receives a `row` value that looks like: `CustomRow([('job_id', {'value': 2, 'label': '2'}), ('host', 'cldellow.com')])` I expected the `job_id` value to be `2`, but it's actually `{'value': 2, 'label': '2'}`. I can work around this, but was wondering if this was intended behaviour?	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/1973/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1552368054	I_kwDOBm6k_c5ch0G2	2000	rewrite_sql hook	cldellow 193185	open		1	2023-01-23T01:02:52Z	2023-01-23T06:08:01Z	CONTRIBUTOR	I'm not sold that this is a good idea, but thought it'd be worth writing up a ticket. Proposal: add a hook like `python def rewrite_sql(datasette, database, request, fn, sql, params)` It would be called from Database.execute, Database.execute_write, Database.execute_write_script, Database.execute_write_many before running the user's SQL. `fn` would indicate which method was being used, in case that's relevant for the SQL inspection -- for example `execute` only permits a single statement. The hook could return a SQL statement to be executed instead, or an async function to be awaited on that returned the SQL to be executed. Plugins that could be written with this hook: https://github.com/cldellow/datasette-ersatz-table-valued-functions would use this to avoid monkey-patching a plugin to inspect and reject unsafe Spatialite function calls (reported by Simon in Discord) a plugin to do more general rewrites of queries to enforce table or row-level security, for example, based on the currently logged in actor's ID a plugin to maintain audit tables when users write to a table a plugin to cache expensive queries (eg the queries that drive facets) - these could allow stale reads if previously cached, then refresh them in an offline queue Flaws with this idea: `execute_fn` and `execute_write_fn` would not go through this hook, which limits the guarantees you can make about it for security purposes.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/2000/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1560651350	I_kwDOCGYnMM5dBaZW	523	Feature request: trim all leading and trailing white space for all columns for all tables in a database	fgregg 536941	open		1	2023-01-28T02:40:10Z	2023-01-28T02:41:14Z	CONTRIBUTOR	It's pretty common that i need to trim leading or trailing white space from lots of columns in a database a part of an initial ETL. I use the following recipe a lot, and it would be great to include this functionality into sqlite-utils `trimify.sql` `sql select 'select group_concat(''update [' \|\| name \|\| '] set ['' \|\| name \|\| ''] = trim(['' \|\| name \|\| ''])'', ''; '') \|\| ''; '' as sql_to_run from pragma_table_info('''\|\|name\|\|''');' from sqlite_schema;` then something like: `bash sqlite3 example.db < scripts/trimify.sql > table_trim.sql && \ sqlite3 $example.db < table_trim.sql > trim.sql && \ sqlite3 $example.db < trim.sql`	sqlite-utils 140912432	issue	{ "url": "https://api.github.com/repos/simonw/sqlite-utils/issues/523/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1565179870	I_kwDOBm6k_c5dSr_e	2013	Datasette uses non-standard quoting for identifiers	cldellow 193185	open		0	2023-02-01T00:05:39Z	2023-02-01T00:06:30Z	CONTRIBUTOR	Related to #2001, but where #2001 was about literals, this is about identifiers From https://www.sqlite.org/lang_keywords.html: "keyword" A keyword in double-quotes is an identifier. [keyword] A keyword enclosed in square brackets is an identifier. This is not standard SQL. This quoting mechanism is used by MS Access and SQL Server and is included in SQLite for compatibility. Datasette uses this quoting here -- https://github.com/simonw/datasette/blob/0b4a28691468b5c758df74fa1d72a823813c96bf/datasette/utils/init.py#L345-L349, in some of the other DB access code, and in some of the test fixtures. Migrating to standard double quote identifiers would make it easier to get Datasette working with alternative backends	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/2013/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1571711808	I_kwDOBm6k_c5drmtA	2018	`check_visibility` gives confusing (wrong?) results if permission is `None`	cldellow 193185	open		0	2023-02-06T01:03:08Z	2023-02-06T01:03:46Z	CONTRIBUTOR	I'm trying to gate access to an edit UI on the user having `update-row` on the underlying view or table. I expected datasette.check_visibility to be a good way to do this: ```python visible, private = await datasette.check_visibility( request.actor, permissions=[ ("update-row", (database, table)), ], ) `if not visible: return None` ``` But `visible` is returning true, even when there is no explicit `update-row` permission. (In this case, `request.actor` is `None`.) Based on the update-row permissions docs, I expected this to be default deny, and so no explicit permission would result in false. I think the root cause is that `check_visibility` calls `ensure_permissions` and expects it to throw if the permission is not available. But `ensure_permissions` does not throw when `permission_allowed` returns None: https://github.com/simonw/datasette/blob/1.0a2/datasette/app.py#L825-L829	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/2018/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1595340692	I_kwDOCGYnMM5fFveU	530	add ability to configure "on delete" and "on update" attributes of foreign keys:	fgregg 536941	open		2	2023-02-22T15:44:14Z	2023-05-08T20:39:01Z	CONTRIBUTOR	sqlite supports these, and it would be quite nice to be able to add them with sqlite-utils. https://www.sqlite.org/foreignkeys.html#fk_actions	sqlite-utils 140912432	issue	{ "url": "https://api.github.com/repos/simonw/sqlite-utils/issues/530/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1605959201	I_kwDOBm6k_c5fuP4h	2032	datasette errors when foreign key integrity is enabled	cldellow 193185	open		0	2023-03-02T01:27:51Z	2023-03-02T01:31:58Z	CONTRIBUTOR	By default, SQLite does not enforce foreign key constraints. I typically enable these checks by running: `sql PRAGMA foreign_keys = ON;` inside of a `prepare_connection` hook. If a plugin causes the schema to change (eg datasette-scraper creating a new table, or datasette-edit-schema changing a column), then https://github.com/simonw/datasette/blob/0b4a28691468b5c758df74fa1d72a823813c96bf/datasette/utils/internal_db.py#L71-L77 will fail with: `FOREIGN KEY constraint failed` This could be resolved by either: - deleting from the `tables` column last - changing the schema so that the foreign keys have ON DELETE CASCADE Let me know if you'd be open to a PR that addresses this -- since foreign key constraints aren't enabled by default, I guess it's questionable whether this is a bug. I think I can workaround this by inspecting the database parameter in `prepare_connection` and trying not to enable fkey checks on the `_internal` database.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/2032/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1620515757	I_kwDOBm6k_c5glxut	2039	Subtle bug with `--load-extension` and `--static` flags with absolute Windows paths with`C:\`	asg017 15178711	open		0	2023-03-12T21:18:52Z	2023-03-12T21:18:52Z	CONTRIBUTOR	From the Datasette discord: A user tried running the following command on windows: `datasette --load-extension="C:\spatialite\mod_spatialite-5.0.1-win-x86\mod_spatialite.dll"` This failed with `"The specified module could not be found"`, because the entrypoint option introduced in #1789 splits the input differently. Instead of loading the extension found at `"C:\spatialite\mod_spatialite-5.0.1-win-x86\mod_spatialite.dll"`, it instead tried to load the extension at `"C"` with entrypoint `"\spatialite\mod_spatialite-5.0.1-win-x86\mod_spatialite.dll". This is hard because most absolute windows paths have a colon in them, like `C:\foo.txt` or `D:\bar.txt`. I'd image the `--static` flag is also vulnerable to this type of bug. The "solution" is to use a relative path instead, but that doesn't feel that great.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/2039/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1740026046	I_kwDOCGYnMM5ntrC-	556	Support storing incrementally piped values	mcint 601708	open		1	2023-06-04T00:45:23Z	2023-06-04T01:21:15Z	CONTRIBUTOR	I'm trying to use sqlite-utils to data generated incrementally. There are a few aspects of this that I don't currently know how to handle. I would like an option to apply writes incrementally, line-by-line as they are received. I would like an option to echo incremental progress. And, it would be nice to have In particular, I'm using CoreLocationCLI -w -j to generate, newline-delimited JSON. One variant of the command `stdbuf -oL CoreLocationCLI -w -j \| pee 'sqlite-utils insert loc.db loc -' nl` `pee`, from `moreutils`, is like `tee` but spawns and pipes to the processes created by invoking each of its arguments, so, for gratuitous demonstration, `pee 'sponge out.log' cat` would behave like `tee`. It looks like I can get what I want with: `stdbuf -oL CoreLocationCLI -w -j \| while read line; do <<<"$line" sqlite-utils insert loc.db loc -; echo "$line"; done \| nl`	sqlite-utils 140912432	issue	{ "url": "https://api.github.com/repos/simonw/sqlite-utils/issues/556/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1781530343	I_kwDOBm6k_c5qL_7n	2093	Proposal: Combine settings, metadata, static, etc. into a single `datasette.yaml` File	asg017 15178711	open		8	2023-06-29T21:18:23Z	2023-09-11T20:19:32Z	CONTRIBUTOR	Very often I get tripped up when trying to configure my Datasette instances. For example: if I want to change the port my app listen too, do I do that with a CLI flag, a `--setting` flag, inside `metadata.json`, or an env var? If I want to up the time limit of SQL statements, is that under `metadata.json` or a setting? Where does my plugin configuration go? Normally I need to look it up in Datasette docs, and I quickly find my answer, but the number of places where "config" goes it overwhelming. Flat CLI flags like `--port`, `--host`, `--cors`, etc. `--setting`, like `default_page_size`, `sql_time_limit_ms` etc Inside `metadata.json`, including plugin configuration Typically my Datasette deploys are extremely long shell commands, with multiple `--setting` and other CLI flags. Proposal: Consolidate all "config" into `datasette.toml` I propose that we add a new `datasette.toml` that combines "settings", "metadata", and other common CLI flags like `--port` and `--cors` into a single file. It would be similar to "Cargo.toml" in Rust projects, "package.json" in Node projects, and "pyproject.toml" in Python, etc. A sample of what it could look like: ```toml "top level" configuration that are currently CLI flags on `datasette serve` [config] port = 8020 host = "0.0.0.0" cors = true replaces multiple `--setting` flags [settings] base_url = "/app/datasette/" default_allow_sql = true sql_time_limit_ms = 3500 replaces `metadata.json`. The contents of datasette-metadata.json could be defined in this file instead, but supporting separate files is nice (since those are easy to machine-generate) [metadata] include="./datasette-metadata.json" plugin-specific [plugins] [plugins.datasette-auth-github] client_id = {env = "DATASETTE_AUTH_GITHUB_CLIENT_ID"} client_secret = {env = "GITHUB_CLIENT_SECRET"} [plugins.datasette-cluster-map] latitude_column = "lat" longitude_column = "lon" ``` Pros Instead of multiple files and CLI flags, everything could be in one tidy file Editing config in a separate file is easier than editing CLI flags, since you don't have to kill a process + edit a command every time New users will know "just edit my `datasette.toml` instead of needing to learn metadata + settings + CLI flags Better dev experience for multiple environment. For example, could have `datasette -c datasette-dev.toml` for local dev environments (enables SQL, debug plugins, long timeouts, etc.), and a `datasette -c datasette-prod.toml` for "production" (lower timeouts, less plugins, monitoring plugins, etc.) Cons Yet another config-management system. Now Datasette users will need to know about metadata, settings, CLI flags, and `datasette.toml`. However with enough documentation + announcements + examples, I think we can get ahead of it. If toml is chosen, would need to add a toml parser for Python version <3.11 Multiple sources of config require priority. For example: Would `--setting default_allow_sql off` override the value inside `[settings]`? What about `--port`? Other Notes Toml I chose toml over json because toml supports comments. I chose toml over yaml because Python 3.11 has builtin support for it. I also find toml easier to work with since it doesn't have the odd "gotchas" that YAML has ("ex `3.10` resolving to `3.1`, Norway `NO` resolving to `false`, etc.). It also mimics `pyproject.toml` which is nice. Happy to change my mind about this however Plugin config will be difficult Plugin config is currently in `metadata.json` in two places: Top level, under `"plugins.[plugin-name]"`. This fits well into `datasette.toml` as `[plugins.plugin-name]` Table level, under `"databases.[db-name].tables.[table-name].plugins.[plugin-name]`. This doesn't fit that well into `datasette.toml`, unless it's nested under `[metadata]`? Extensions, static, one-off plugins? We could also include equivalents of `--plugins-dir`, `--static`, and `--load-extension` into `datasette.toml`, but I'd imagine there's a few security concerns there to think through. Explicitly list with plugins to use? I believe Datasette by default will load all install plugins on startup, but maybe `datasette.toml` can specify a list of plugins to use? For example, a dev version of `datasette.toml` can specify `datasette-pretty-traces`, but the prod version can leave it out	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/2093/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1783304750	I_kwDOBm6k_c5qSxIu	2094	JS Plugin Hooks for the Code Editor	asg017 15178711	open		0	2023-07-01T00:51:57Z	2023-07-01T00:51:57Z	CONTRIBUTOR	When #2052 merges, I'd like to add support to add extensions/functions to the Datasette code editor. I'd eventually like to build a JS plugin for `sqlite-docs`, to add things like: Inline documentation for tables/columns on hover Inline docs for custom functions that are loaded in More detailed autocomplete for tables/columns/functions I did some hacking to see what this would look like, see here: There can be a new hook that allows JS plugins to add new "extension" in the CodeMirror editorview here: https://github.com/simonw/datasette/blob/8cd60fd1d899952f1153460469b3175465f33f80/datasette/static/cm-editor-6.0.1.js#L25 Will need some more planning. For example, the Codemirror bundle in Datasette has functions that we could re-export for plugins to use (so we don't load 2 version of `"@codemirror/autocomplete"`, for example.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/2094/reactions", "total_count": 1, "+1": 0, "-1": 0, "laugh": 0, "hooray": 1, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1821108702	I_kwDOCGYnMM5si-ne	579	Special handling for SQLite column of type `JSON`	asg017 15178711	open		0	2023-07-25T20:37:23Z	2023-07-25T20:37:23Z	CONTRIBUTOR	`sqlite-utils` should detect and have specially handling for column with a `JSON` column. For example: `sql CREATE TABLE "dogs" ( id INTEGER PRIMARY KEY, name TEXT, friends JSON );` Automatic Nesting According to "Nested JSON Values", sqlite-utils will only expand JSON if the `--json-cols` flag is passed. It looks like it'll try to `json.load` all text column to test if its JSON, which can get expensive on non-json columns. Instead, `sqlite-utils` should be default (ie without the `--json-cols` flags) do the `maybe_json()` operation on columns with a declared `JSON` type. So the above table would expand the `"friends"` column as expected, withoutthe `--json-cols` flag: `bash sqlite-utils dogs.db "select * from dogs" \| python -mjson.tool` `[ { "id": 1, "name": "Cleo", "friends": [ { "name": "Pancakes" }, { "name": "Bailey" } ] } ]` I'm sure there's other ways `sqlite-utils` can specially handle JSON columns, so keeping this open while I think of more	sqlite-utils 140912432	issue	{ "url": "https://api.github.com/repos/simonw/sqlite-utils/issues/579/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1822813627	I_kwDOBm6k_c5spe27	2108	some (many?) SQL syntax errors are not throwing errors with a .csv endpoint	fgregg 536941	open		0	2023-07-26T16:57:45Z	2023-07-26T16:58:07Z	CONTRIBUTOR	here's a CTE query that should always fail with a syntax error: `sql with foo as (nonsense) select * from foo;` when we make this query against the default endpoint, we do indeed get a 400 status code the problem is returned to the user: https://global-power-plants.datasettes.com/global-power-plants?sql=with+foo+as+%28nonsense%29+select++from+foo%3B but, if we use the csv endpoint, we get a 200 status code and no indication of a problem: https://global-power-plants.datasettes.com/global-power-plants.csv?sql=with+foo+as+%28nonsense%29+select++from+foo%3B same with this bad sql `sql select a, from foo;` https://global-power-plants.datasettes.com/global-power-plants?sql=select%0D%0A++a%2C%0D%0Afrom%0D%0A++foo%3B vs https://global-power-plants.datasettes.com/global-power-plants.csv?sql=select%0D%0A++a%2C%0D%0Afrom%0D%0A++foo%3B but, datasette catches this bad sql at both endpoints: `sql slect a from foo;` https://global-power-plants.datasettes.com/global-power-plants?sql=slect%0D%0A++a%0D%0Afrom%0D%0A++foo%3B https://global-power-plants.datasettes.com/global-power-plants.csv?sql=slect%0D%0A++a%0D%0Afrom%0D%0A++foo%3B	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/2108/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1855885427	I_kwDOBm6k_c5unpBz	2143	De-tangling Metadata before Datasette 1.0	asg017 15178711	open		24	2023-08-18T00:51:50Z	2023-08-24T18:28:27Z	CONTRIBUTOR	Metadata in Datasette is a really powerful feature, but is a bit difficult to work with. It was initially a way to add "metadata" about your "data" in Datasette instances, like descriptions for databases/tables/columns, titles, source URLs, licenses, etc. But it later became the go-to spot for other Datasette features that have nothing to do with metadata, like permissions/plugins/canned queries. Specifically, I've found the following problems when working with Datasette metadata: Metadata cannot be updated without re-starting the entire Datasette instance. The `metadata.json`/`metadata.yaml` has become a kitchen sink of unrelated (imo) features like plugin config, authentication config, canned queries The Python APIs for defining extra metadata are a bit awkward (the `datasette.metadata()` class, `get_metadata()` hook, etc.) Possible solutions Here's a few ideas of Datasette core changes we can make to address these problems. Re-vamp the Datasette Python metadata APIs The Datasette object has a single `datasette.metadata()` method that's a bit difficult to work with. There's also no Python API for inserted new metadata, so plugins have to rely on the `get_metadata()` hook. The `get_metadata()` hook can also be improved - it doesn't work with async functions yet, so you're quite limited to what you can do. (I'm a bit fuzzy on what to actually do here, but I imagine it'll be very small breaking changes to a few Python methods) Add an optional `datasette_metadata` table Datasette should detect and use metadata stored in a new special table called `datasette_metadata`. This would be a regular table that a user can edit on their own, and would serve as a "live updating" source of metadata, than can be changed while the Datasette instance is running. Not too sure what the schema would look like, but I'd imagine: `sql CREATE TABLE datasette_metadata( level text, target any, key text, value any, primary key (level, target) )` Every row in this table would map to a single metadata "entry". `level` would be one of "datasette", "database", "table", "column", which is the "level" the entry describes. For example, `level="table"` means it is metadata about a specific table, `level="database"` for a specific database, or `level="datasette"` for the entire Datasette instance. `target` would "point" to the specific object the entry metadata is about, and would depend on what `level` is specific. `level="database"`: `target` would be the string name of the database that the metadata entry is about. ex `"fixtures"` `level="table"`: `target` would be a JSON array of two strings. The first element would be the database name, and the second would be the table name. ex `["fixtures", "students"]` `level="column"`: `target` would be a JSON array of 3 strings: The database name, table name, and column name. Ex `["fixtures", "students", "student_id"`] `key` would be the type of metadata entry the row has, similar to the current "keys" that exist in `metadata.json`. Ex `"about_url"`, `"source"`, `"description"`, etc `value` would be the text value of be metadata entry. The literal text value of a description, about_url, column_label, etc A quick sample: level \| target \| key \| value -- \| -- \| -- \| -- datasette \| NULL \| title \| my datasette title... db \| fixtures \| source \| <description of my database source> table \| ["fixtures", "students"] \| label_column \| student_name column \| ["fixtures", "students", "birthdate"] \| description \| <description of the fixtures.students.birthdate column> This `datasette_metadata` would be configured with other tools, and hopefully not manually by end users. Datasette Core could also offer a UI for editing entries in `datasette_metadata`, to update descriptions/columns on the fly. Re-vamp `metadata.json` and move non-metadata config to another place The motivation behind this is that it's awkward that `metadata.json` contains config about things that are not strictly metadata, including: Plugin configuration Authentication/permissions (ex the `allow` key on datasettes/databases/tables Canned queries. might be controversial, but in my mind, canned queries are application-specific code and configuration, and don't describe the data that exists in SQLite databases. I think we should move these outside of `metadata.json` and into a different file. The `datasette.json` idea in #2093 may be a good solution here: plugin/permissions/canned queries can be defined in `datasette.json`, while `metadata.json`/`datasette_metadata` will strictly be about documenting databases/tables/columns.	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/2143/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1865869205	I_kwDOBm6k_c5vNueV	2157	Proposal: Make the `_internal` database persistent, customizable, and hidden	asg017 15178711	open		3	2023-08-24T20:54:29Z	2023-08-31T02:45:56Z	CONTRIBUTOR	The current `_internal` database is used by Datasette core to cache info about databases/tables/columns/foreign keys of databases in a Datasette instance. It's a temporary database created at startup, that can only be seen by the root user. See an example `_internal` DB here, after logging in as root. The current `_internal` database has a few rough edges: It's part of `datasette.databases`, so many plugins have to specifically exclude `_internal` from their queries examples here It's only used by Datasette core and can't be used by plugins or 3rd parties It's created from scratch at startup and stored in memory. Why is fine, the performance is great, but persistent storage would be nice. Additionally, it would be really nice if plugins could use this `_internal` database to store their own configuration, secrets, and settings. For example: `datasette-auth-tokens` creates a `_datasette_auth_tokens` table to store auth token metadata. This could be moved into the `_internal` database to avoid writing to the gues database `datasette-socrata` creates a `socrata_imports` table, which also can be in `_internal` `datasette-upload-csvs` creates a `_csv_progress_` table, which can be in `_internal` `datasette-write-ui` wants to have the ability for users to toggle whether a table appears editable, which can be either in `datasette.yaml` or on-the-fly by storing config in `_internal` In general, these are specific features that Datasette plugins would have access to if there was a central internal database they could read/write to: Dynamic configuration. Changing the `datasette.yaml` file works, but can be tedious to restart the server every time. Plugins can define their own configuration table in `_internal`, and could read/write to it to store configuration based on user actions (cell menu click, API access, etc.) Caching. If a plugin or Datasette Core needs to cache some expensive computation, they can store it inside `_internal` (possibly as a temporary table) instead of managing their own caching solution. Audit logs. If a plugin performs some sensitive operations, they can log usage info to `_internal` for others to audit later. Long running process status. Many plugins (`datasette-upload-csvs`, `datasette-litestream`, `datasette-socrata`) perform tasks that run for a really long time, and want to give continue status updates to the user. They can store this info inside`_internal` Safer authentication. Passwords and authentication plugins usually store credentials/hashed secrets in configuration files or environment variables, which can be difficult to handle. Now, they can store them in `_internal` Proposal We remove `_internal` from `datasette.databases` property. We add new `datasette.get_internal_db()` method that returns the `_internal` database, for plugins to use We add a new `--internal internal.db` flag. If provided, then the `_internal` DB will be sourced from that file, and further updates will be persisted to that file (instead of an in-memory database) When creating internal.db, create a new `_datasette_internal` table to mark it a an "datasette internal database" In `datasette serve`, we check for the existence of the `_datasette_internal` table. If it exists, we assume the user provided that file in error and raise an error. This is to limit the chance that someone accidentally publishes their internal database to the internet. We could optionally add a `--unsafe-allow-internal` flag (or database plugin) that allows someone to do this if they really want to. New features unlocked with this These features don't really need a standardized `_internal` table per-say (plugins could currently configure their own long-time storage features if they really wanted to), but it would make it much simpler to create these kinds of features with a persistent application database. `datasette-comments` : A plugin for commenting on rows or specific values in a database. Comment contents + threads + email notification info can be stored in `_internal` Bookmarks: "Bookmarking" an SQL query could be stored in `_internal`, or a URL link shortener Webhooks: If a plugin wants to either consume a webhook or create a new one, they can store hashed credentials/API endpoints in `_internal`	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/2157/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1900026059	I_kwDOBm6k_c5xQBjL	2188	Plugin Hooks for "compile to SQL" languages	asg017 15178711	open		2	2023-09-18T01:37:15Z	2023-09-18T06:58:53Z	CONTRIBUTOR	There's a ton of tools/languages that compile to SQL, which may be nice in Datasette. Some examples: Logica https://logica.dev PRQL https://prql-lang.org Malloy, but not sure if it works with SQLite? https://github.com/malloydata/malloy It would be cool if plugins could extend Datasette to use these languages, in both the code editor and API usage. A few things I'd imagine a `datasette-prql` or `datasette-logica` plugin would do: `prql=` instead of `sql=` Code editor support (syntax highlighting, autocomplete) Hide/show SQL	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/2188/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
1994857251	I_kwDOBm6k_c525xsj	2208	No suggested facets when a column named 'value' is included	rgieseke 198537	open		1	2023-11-15T14:11:17Z	2023-11-15T14:18:59Z	CONTRIBUTOR	When a column named 'value' is included there are no suggested facets is shown as the query uses an alias of 'value'. https://github.com/simonw/datasette/blob/452a587e236ef642cbc6ae345b58767ea8420cb5/datasette/facets.py#L168-L174 Currently the following is shown (from https://latest.datasette.io/fixtures/facetable) When I add a column named 'value' only the JSON facets are processed. I think that not using aliases could be a solution (except if someone wants to use a column named `count(*)` though this seems to be unlikely). I'll open a PR with that. There is also a TODO with a similar question in the same file. I have not looked into that yet. https://github.com/simonw/datasette/blob/452a587e236ef642cbc6ae345b58767ea8420cb5/datasette/facets.py#L512	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/2208/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
2028698018	I_kwDOBm6k_c5463mi	2213	feature request: gzip compression of database downloads	fgregg 536941	open		1	2023-12-06T14:35:03Z	2023-12-06T15:05:46Z	CONTRIBUTOR	At the bottom of database pages, datasette gives users the opportunity to download the underlying sqlite database. It would be great if that could be served gzip compressed. this is similar to #1213, but for me, i don't need datasette to compress html and json because my CDN layer does it for me, however, cloudflare at least, will not compress a mimetype of "application" (see list of mimetype: https://developers.cloudflare.com/speed/optimization/content/brotli/content-compression/)	datasette 107914493	issue	{ "url": "https://api.github.com/repos/simonw/datasette/issues/2213/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
488874815	MDU6SXNzdWU0ODg4NzQ4MTU=	5	Write tests that simulate the Twitter API	simonw 9599	open		1	2019-09-03T23:55:35Z	2019-09-03T23:56:28Z	MEMBER	I can use betamax for this: https://pypi.org/project/betamax/	twitter-to-sqlite 206156866	issue	{ "url": "https://api.github.com/repos/dogsheep/twitter-to-sqlite/issues/5/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
496415321	MDU6SXNzdWU0OTY0MTUzMjE=	1	Figure out some interesting example SQL queries	simonw 9599	open		9	2019-09-20T15:28:07Z	2021-05-03T03:46:23Z	MEMBER	My knowledge of genetics has left me short here. I'd love to be able to provide some interesting example SELECT queries - maybe one that spots if you are likely to have red hair?	genome-to-sqlite 209590345	issue	{ "url": "https://api.github.com/repos/dogsheep/genome-to-sqlite/issues/1/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
503243784	MDU6SXNzdWU1MDMyNDM3ODQ=	3	Extract images into separate tables	simonw 9599	open		1	2019-10-07T05:43:01Z	2020-09-01T06:17:45Z	MEMBER	As already done with authors. Slightly harder because images do not have a universally unique ID. Also need to figure out what to do about there being columns for both `image` and `images`.	pocket-to-sqlite 213286752	issue	{ "url": "https://api.github.com/repos/dogsheep/pocket-to-sqlite/issues/3/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
505673645	MDU6SXNzdWU1MDU2NzM2NDU=	16	Do a better job with archived direct message threads	simonw 9599	open		0	2019-10-11T06:55:21Z	2019-10-11T06:55:27Z	MEMBER	https://github.com/dogsheep/twitter-to-sqlite/blob/fb2698086d766e0333a55bb73435e7283feeb438/twitter_to_sqlite/archive.py#L98-L99	twitter-to-sqlite 206156866	issue	{ "url": "https://api.github.com/repos/dogsheep/twitter-to-sqlite/issues/16/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
530491074	MDU6SXNzdWU1MzA0OTEwNzQ=	14	Command for importing events	simonw 9599	open		3	2019-11-29T21:28:58Z	2020-04-14T19:38:34Z	MEMBER	Eg from https://api.github.com/users/simonw/events Docs here: https://developer.github.com/v3/activity/events/#list-events-performed-by-a-user	github-to-sqlite 207052882	issue	{ "url": "https://api.github.com/repos/dogsheep/github-to-sqlite/issues/14/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
599776345	MDU6SXNzdWU1OTk3NzYzNDU=	24	Feature idea: github-to-sqlite everything ...	simonw 9599	open		0	2020-04-14T18:34:00Z	2020-04-14T18:34:00Z	MEMBER	At the moment if you want to pull all your repos, issues, issues comments etc you have to do it with a sequence of separate commands. Consider adding a `everything` or `all` command which fetches everything that the tool knows how to fetch, and is designed to be run on a cron in a way that fetches just new stuff each time.	github-to-sqlite 207052882	issue	{ "url": "https://api.github.com/repos/dogsheep/github-to-sqlite/issues/24/reactions", "total_count": 7, "+1": 7, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
602533300	MDU6SXNzdWU2MDI1MzMzMDA=	1	Import photo metadata from Apple Photos into SQLite	simonw 9599	open	Apple Photos online and securely browsable 5324096	8	2020-04-18T19:23:26Z	2020-05-04T02:41:40Z	MEMBER	Faces, albums, locations, that kind of thing.	dogsheep-photos 256834907	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-photos/issues/1/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
602533481	MDU6SXNzdWU2MDI1MzM0ODE=	3	Import EXIF data into SQLite - lens used, ISO, aperture etc	simonw 9599	open	Apple Photos online and securely browsable 5324096	2	2020-04-18T19:24:31Z	2021-10-05T12:38:24Z	MEMBER		dogsheep-photos 256834907	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-photos/issues/3/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
602585497	MDU6SXNzdWU2MDI1ODU0OTc=	7	Integrate image content hashing	simonw 9599	open		2	2020-04-19T00:36:58Z	2021-08-26T02:01:01Z	MEMBER	To spot duplicate images (where the file content differs such that the sha256 is no longer a match) it would be useful to calculate and store perceptual hashes of some sort.	dogsheep-photos 256834907	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-photos/issues/7/reactions", "total_count": 1, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 1, "rocket": 0, "eyes": 0 }
602619330	MDU6SXNzdWU2MDI2MTkzMzA=	45	Use raise_for_status() everywhere	simonw 9599	open		1	2020-04-19T04:38:28Z	2020-04-19T04:39:22Z	MEMBER	I keep seeing errors which I think are caused by authentication or rate limit problems but which appear to be unexpected JSON responses - presumably because they are actually an error message. Recent example: https://github.com/simonw/jsk-fellows-on-twitter/runs/598892575 Using `response.raise_for_status()` everywhere will make these errors less confusing.	twitter-to-sqlite 206156866	issue	{ "url": "https://api.github.com/repos/dogsheep/twitter-to-sqlite/issues/45/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
606033104	MDU6SXNzdWU2MDYwMzMxMDQ=	12	If less than 500MB, show size in MB not GB	simonw 9599	open		1	2020-04-24T04:35:01Z	2020-04-24T04:35:25Z	MEMBER	Just saw this: `Uploading 0.05 GB`	dogsheep-photos 256834907	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-photos/issues/12/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
607888367	MDU6SXNzdWU2MDc4ODgzNjc=	13	Also upload movie files	simonw 9599	open		2	2020-04-27T22:11:25Z	2020-04-28T00:39:45Z	MEMBER	The `upload` command currently only handles static images: https://github.com/dogsheep/photos-to-sqlite/blob/d939455af00e07866686457ee2fcb9b2d1b7194e/photos_to_sqlite/utils.py#L26-L33 Need to cover movies taken by my phone and DSLR too.	dogsheep-photos 256834907	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-photos/issues/13/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
608512747	MDU6SXNzdWU2MDg1MTI3NDc=	14	Annotate photos using the Google Cloud Vision API	simonw 9599	open		5	2020-04-28T18:09:03Z	2020-04-28T18:19:06Z	MEMBER	It can detect faces, run OCR, do image labeling (it knows what a lemur is!) and do object localization where it identifies objects and returns bounding polygons for them.	dogsheep-photos 256834907	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-photos/issues/14/reactions", "total_count": 3, "+1": 2, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 1, "rocket": 0, "eyes": 0 }
612287234	MDU6SXNzdWU2MTIyODcyMzQ=	16	Import machine-learning detected labels (dog, llama etc) from Apple Photos	simonw 9599	open		13	2020-05-05T02:45:43Z	2020-05-05T05:38:16Z	MEMBER	Follow-on from #1. Apple Photos runs some very sophisticated machine learning on-device to figure out if photos are of dogs, llamas and so on. I really want to extract those labels out into my own database.	dogsheep-photos 256834907	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-photos/issues/16/reactions", "total_count": 2, "+1": 0, "-1": 0, "laugh": 1, "hooray": 1, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
612860758	MDU6SXNzdWU2MTI4NjA3NTg=	18	Switch CI solution to GitHub Actions with a macOS runner	simonw 9599	open		1	2020-05-05T20:03:50Z	2020-05-05T23:49:18Z	MEMBER	Refs #17.	dogsheep-photos 256834907	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-photos/issues/18/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
615626118	MDU6SXNzdWU2MTU2MjYxMTg=	22	Try out ExifReader	simonw 9599	open		4	2020-05-11T06:32:13Z	2020-05-14T05:59:53Z	MEMBER	https://pypi.org/project/ExifReader/ New fork that should be able to handle EXIF in HEIC files. Forked here: https://github.com/ianare/exif-py/issues/102#issuecomment-626376522 Refs #3	dogsheep-photos 256834907	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-photos/issues/22/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
621323348	MDU6SXNzdWU2MjEzMjMzNDg=	24	Configurable URL for images	simonw 9599	open		1	2020-05-19T22:25:56Z	2020-05-20T06:00:29Z	MEMBER	This is hard-coded at the moment, which is bad: https://github.com/dogsheep/photos-to-sqlite/blob/d5d69b9019703c47bc251444838578dd752801e2/photos_to_sqlite/cli.py#L269-L272	dogsheep-photos 256834907	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-photos/issues/24/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
621486115	MDU6SXNzdWU2MjE0ODYxMTU=	27	photos_with_apple_metadata view should include labels	simonw 9599	open		0	2020-05-20T06:06:17Z	2020-05-20T06:06:17Z	MEMBER	https://dogsheep-photos.dogsheep.net/public/photos_with_apple_metadata?place_city=New+Orleans&_facet=place_city&_facet_array=albums&_facet_array=persons Here's one way to add that: `sql select rowid, photo, ( select json_group_array( json_object( 'label', normalized_string, 'href', '/photos/labelled?_hide_sql=1&label=' \|\| normalized_string ) ) from labels where labels.uuid = photos_with_apple_metadata.uuid ) as labels, date,`	dogsheep-photos 256834907	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-photos/issues/27/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
673602857	MDU6SXNzdWU2NzM2MDI4NTc=	9	Define a view that displays photos correctly	simonw 9599	open		0	2020-08-05T14:53:39Z	2020-08-05T14:53:39Z	MEMBER	The `photos` table stores data like this: id \| createdAt \| source \| prefix \| suffix \| width \| height \| visibility \| created ▲ \| user -- \| -- \| -- \| -- \| -- \| -- \| -- \| -- \| -- \| -- 5e12c9708506bc000840262a \| January 06, 2020 - 05:45:20 UTC \| Swarm for iOS 1 \| https://fastly.4sqi.net/img/general/ \| /15889193_AXxGk4I1nbzUZuyYqObgbXdJNyEHiwj6AUDq0tPZWtw.jpg \| 1920 \| 1440 \| public \| 2020-01-06T05:45:20 \| 15889193 The photo URL can be derived from those pieces - define a SQL view which does that (using `datasette-json-html` to display the pictures)	swarm-to-sqlite 205429375	issue	{ "url": "https://api.github.com/repos/dogsheep/swarm-to-sqlite/issues/9/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
689848827	MDU6SXNzdWU2ODk4NDg4Mjc=	6	ISO timestamps	simonw 9599	open		0	2020-09-01T06:16:42Z	2020-09-01T06:16:42Z	MEMBER	The `time_added`, `time_updated` and `time_read` columns currently store data like this: `September 19, 2019 - 00:30:30 UTC` Should use ISO instead, e.g. `2020-07-26T01:05:24+00:00`	pocket-to-sqlite 213286752	issue	{ "url": "https://api.github.com/repos/dogsheep/pocket-to-sqlite/issues/6/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
689850810	MDU6SXNzdWU2ODk4NTA4MTA=	6	Set up a demo instance	simonw 9599	open		0	2020-09-01T06:20:24Z	2020-09-01T06:20:24Z	MEMBER	Once I've got the Datasette plugin to a state where it's worth building a demo: #3 I can use data from my public https://github-to-sqlite.dogsheep.net/ demo plus the Pocket data subset I use for the demo in https://github.com/dogsheep/pocket-to-sqlite/issues/5 - I could pull in the https://dogsheep-photos.dogsheep.net/ photos data too.	dogsheep-beta 197431109	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-beta/issues/6/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
692202408	MDU6SXNzdWU2OTIyMDI0MDg=	12	Idea: maps and GeoJSON support	simonw 9599	open		0	2020-09-03T18:47:10Z	2020-09-04T01:45:03Z	MEMBER	It would be cool if the `display_sql` could return a column populated with GeoJSON which would the automatically be displayed on a map in the results (or maybe default JS would look for a `class="geojson"` element output by the `display` template) - ala https://github.com/simonw/datasette-leaflet-geojson Then I could render workout routes on a map, or Swarm checkin points.	dogsheep-beta 197431109	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-beta/issues/12/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
694136490	MDU6SXNzdWU2OTQxMzY0OTA=	15	Add a bunch of config examples	simonw 9599	open		1	2020-09-05T17:58:43Z	2020-09-18T23:17:39Z	MEMBER	I can bring these over from my personal Dogsheep.	dogsheep-beta 197431109	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-beta/issues/15/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }
694493566	MDU6SXNzdWU2OTQ0OTM1NjY=	16	Timeline view	simonw 9599	open		3	2020-09-06T19:13:58Z	2020-09-21T02:42:29Z	MEMBER	Ability to browse (and facet) by date.	dogsheep-beta 197431109	issue	{ "url": "https://api.github.com/repos/dogsheep/dogsheep-beta/issues/16/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }

Advanced export

JSON shape: default, array, newline-delimited, object

CREATE TABLE [issues] (
   [id] INTEGER PRIMARY KEY,
   [node_id] TEXT,
   [number] INTEGER,
   [title] TEXT,
   [user] INTEGER REFERENCES [users]([id]),
   [state] TEXT,
   [locked] INTEGER,
   [assignee] INTEGER REFERENCES [users]([id]),
   [milestone] INTEGER REFERENCES [milestones]([id]),
   [comments] INTEGER,
   [created_at] TEXT,
   [updated_at] TEXT,
   [closed_at] TEXT,
   [author_association] TEXT,
   [pull_request] TEXT,
   [body] TEXT,
   [repo] INTEGER REFERENCES [repos]([id]),
   [type] TEXT
, [active_lock_reason] TEXT, [performed_via_github_app] TEXT, [reactions] TEXT, [draft] INTEGER, [state_reason] TEXT);
CREATE INDEX [idx_issues_repo]
                ON [issues] ([repo]);
CREATE INDEX [idx_issues_milestone]
                ON [issues] ([milestone]);
CREATE INDEX [idx_issues_assignee]
                ON [issues] ([assignee]);
CREATE INDEX [idx_issues_user]
                ON [issues] ([user]);

issues

691 rows where state = "open" and type = "issue" sorted by author_association

1. Data seems to have moved to a `data/` subdirectory

2. Some schema(s?) have changed

Motivation

Proposal

Resources

Motivation

Proposal

Proposal: Consolidate all "config" into `datasette.toml`

"top level" configuration that are currently CLI flags on `datasette serve`

replaces multiple `--setting` flags

replaces `metadata.json`.

The contents of datasette-metadata.json could be defined in this file instead, but supporting separate files is nice (since those are easy to machine-generate)

plugin-specific

Pros

Cons

Other Notes

Toml

Plugin config will be difficult

Extensions, static, one-off plugins?

Explicitly list with plugins to use?

Automatic Nesting

Possible solutions

Re-vamp the Datasette Python metadata APIs

Add an optional `datasette_metadata` table

Re-vamp `metadata.json` and move non-metadata config to another place

Proposal

New features unlocked with this

Advanced export

issues

691 rows where state = "open" and type = "issue" sorted by author_association

1. Data seems to have moved to a data/ subdirectory

2. Some schema(s?) have changed

Motivation

Proposal

Resources

Motivation

Proposal

Proposal: Consolidate all "config" into datasette.toml

"top level" configuration that are currently CLI flags on datasette serve

replaces multiple --setting flags

replaces metadata.json.

The contents of datasette-metadata.json could be defined in this file instead, but supporting separate files is nice (since those are easy to machine-generate)

plugin-specific

Pros

Cons

Other Notes

Toml

Plugin config will be difficult

Extensions, static, one-off plugins?

Explicitly list with plugins to use?

Automatic Nesting

Possible solutions

Re-vamp the Datasette Python metadata APIs

Add an optional datasette_metadata table

Re-vamp metadata.json and move non-metadata config to another place

Proposal

New features unlocked with this

Advanced export

1. Data seems to have moved to a `data/` subdirectory

Proposal: Consolidate all "config" into `datasette.toml`

"top level" configuration that are currently CLI flags on `datasette serve`

replaces multiple `--setting` flags

replaces `metadata.json`.

Add an optional `datasette_metadata` table

Re-vamp `metadata.json` and move non-metadata config to another place