home / github

Menu
  • Search all tables
  • GraphQL API

issue_comments

Table actions
  • GraphQL API for issue_comments

5 rows where issue = 711649325 sorted by updated_at descending

✖
✖

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: created_at (date), updated_at (date)

user 1

  • simonw 5

issue 1

  • Better handling of encodings other than utf-8 for "sqlite-utils insert" · 5 ✖

author_association 1

  • OWNER 5
id html_url issue_url node_id user created_at updated_at ▲ author_association body reactions issue performed_via_github_app
710258736 https://github.com/simonw/sqlite-utils/issues/182#issuecomment-710258736 https://api.github.com/repos/simonw/sqlite-utils/issues/182 MDEyOklzc3VlQ29tbWVudDcxMDI1ODczNg== simonw 9599 2020-10-16T17:20:41Z 2020-10-16T17:20:41Z OWNER

Documentation: https://sqlite-utils.readthedocs.io/en/latest/cli.html#inserting-csv-or-tsv-data

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
Better handling of encodings other than utf-8 for "sqlite-utils insert" 711649325  
710178871 https://github.com/simonw/sqlite-utils/issues/182#issuecomment-710178871 https://api.github.com/repos/simonw/sqlite-utils/issues/182 MDEyOklzc3VlQ29tbWVudDcxMDE3ODg3MQ== simonw 9599 2020-10-16T16:27:39Z 2020-10-16T16:28:14Z OWNER

The file is opened for me by click.File(), which also handles things like - for stdin. But i neee to be able to switch the encoding used to read from that based on the --encoding option.

I think the way to do that is to open the file in binary mode and then wrap it in a codec reader: python fp = codecs.getreader(encoding)(fp)

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
Better handling of encodings other than utf-8 for "sqlite-utils insert" 711649325  
708695761 https://github.com/simonw/sqlite-utils/issues/182#issuecomment-708695761 https://api.github.com/repos/simonw/sqlite-utils/issues/182 MDEyOklzc3VlQ29tbWVudDcwODY5NTc2MQ== simonw 9599 2020-10-14T22:33:55Z 2020-10-14T22:35:17Z OWNER

For the moment I'm going to add a --encoding option and some code that catches UnicodeDecodeError and shows an error message that suggests using --encoding.

That error message could detect if the file command is available and, if it is, suggest running file filename.txt to detect the character encoding.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
Better handling of encodings other than utf-8 for "sqlite-utils insert" 711649325  
708693133 https://github.com/simonw/sqlite-utils/issues/182#issuecomment-708693133 https://api.github.com/repos/simonw/sqlite-utils/issues/182 MDEyOklzc3VlQ29tbWVudDcwODY5MzEzMw== simonw 9599 2020-10-14T22:26:26Z 2020-10-14T22:26:26Z OWNER

I could use https://github.com/chardet/chardet to help here, though I'd rather not add it as a dependency (sqlite-utils has very few dependencies at the moment). I could add it as an optional dependency though.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
Better handling of encodings other than utf-8 for "sqlite-utils insert" 711649325  
701626134 https://github.com/simonw/sqlite-utils/issues/182#issuecomment-701626134 https://api.github.com/repos/simonw/sqlite-utils/issues/182 MDEyOklzc3VlQ29tbWVudDcwMTYyNjEzNA== simonw 9599 2020-09-30T20:27:09Z 2020-09-30T20:27:42Z OWNER

It looks like http://maps.natalian.org/data.txt is encoded as latin-1, but sqlite-utils assumes utf-8 and hence breaks.

It would be worth improving the error message here. I could also add a --encoding latin-1 option to sqlite-utils insert to help in consuming files that are stored in charsets other than utf-8.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
Better handling of encodings other than utf-8 for "sqlite-utils insert" 711649325  

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE [issue_comments] (
   [html_url] TEXT,
   [issue_url] TEXT,
   [id] INTEGER PRIMARY KEY,
   [node_id] TEXT,
   [user] INTEGER REFERENCES [users]([id]),
   [created_at] TEXT,
   [updated_at] TEXT,
   [author_association] TEXT,
   [body] TEXT,
   [reactions] TEXT,
   [issue] INTEGER REFERENCES [issues]([id])
, [performed_via_github_app] TEXT);
CREATE INDEX [idx_issue_comments_issue]
                ON [issue_comments] ([issue]);
CREATE INDEX [idx_issue_comments_user]
                ON [issue_comments] ([user]);
Powered by Datasette · Queries took 24.896ms · About: github-to-sqlite
  • Sort ascending
  • Sort descending
  • Facet by this
  • Hide this column
  • Show all columns
  • Show not-blank rows