{"html_url": "https://github.com/simonw/datasette/issues/506#issuecomment-500238035", "issue_url": "https://api.github.com/repos/simonw/datasette/issues/506", "id": 500238035, "node_id": "MDEyOklzc3VlQ29tbWVudDUwMDIzODAzNQ==", "user": {"value": 1059677, "label": "Gagravarr"}, "created_at": "2019-06-09T19:21:18Z", "updated_at": "2019-06-09T19:21:18Z", "author_association": "NONE", "body": "If you don't mind calling out to Java, then Apache Tika is able to tell you what a load of \"binary stuff\" is, plus render it to XHTML where possible.\r\n\r\nThere's a python wrapper around the Apache Tika server, but for a more typical datasette usecase you'd probably just want to grab the Tika CLI jar, and call it with `--detect` and/or `--xhtml` to process the unknown binary blob", "reactions": "{\"total_count\": 0, \"+1\": 0, \"-1\": 0, \"laugh\": 0, \"hooray\": 0, \"confused\": 0, \"heart\": 0, \"rocket\": 0, \"eyes\": 0}", "issue": {"value": 453846217, "label": "Option to display binary data"}, "performed_via_github_app": null}