René's URL Explorer Experiment


Title: Add UTF-8 validity checking to schema · Issue #151 · singer-io/singer-python · GitHub

Open Graph Title: Add UTF-8 validity checking to schema · Issue #151 · singer-io/singer-python

X Title: Add UTF-8 validity checking to schema · Issue #151 · singer-io/singer-python

Description: For data-type "string", the _transform function just attempts to do str(data) and catches an exception to determine if the string is valid. Binary strings with null bytes or other invalid UTF-8 character sequences will pass through this ...

Open Graph Description: For data-type "string", the _transform function just attempts to do str(data) and catches an exception to determine if the string is valid. Binary strings with null bytes or other invalid UTF-8 cha...

X Description: For data-type "string", the _transform function just attempts to do str(data) and catches an exception to determine if the string is valid. Binary strings with null bytes or other invalid...

Opengraph URL: https://github.com/singer-io/singer-python/issues/151

X: @github

direct link

Domain: patch-diff.githubusercontent.com


Hey, it has json ld scripts:
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Add UTF-8 validity checking to schema","articleBody":"For data-type `\"string\"`, the `_transform` function just attempts to do `str(data)` and catches an exception to determine if the string is valid. Binary strings with null bytes or other invalid UTF-8 character sequences will pass through this function as valid strings. However, targets may expect strings to be valid encoded text, such as UTF-8.\r\n\r\nUTF-8 encoding validation can be enforced with a pre_hook when calling transform, but this doesn't inform the target about the type of string. It'd be helpful to somehow include character encoding as part of the schema so that downstream targets can know what to expect and choose the appropriate data type. For example, MySQL has `TEXT` and `BLOB` types to separately handle text and binary strings. One natural place to put this could be the `\"format\"` parameter, though it'd be tedious to have to explicitly specify UTF-8 for every string when that is the default. It'd be convenient to have a way to make UTF-8 the default for all strings in a schema and override it with binary (the current behavior) explicitly for binary fields.","author":{"url":"https://github.com/KBorders01","@type":"Person","name":"KBorders01"},"datePublished":"2021-09-08T13:42:27.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":0},"url":"https://github.com/151/singer-python/issues/151"}

route-pattern/_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format)
route-controllervoltron_issues_fragments
route-actionissue_layout
fetch-noncev2:0dd0eb96-45db-6879-9eda-8e3a3564d89b
current-catalog-service-hash81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114
request-idB5C0:3523AF:5A30DA2:751BAA2:697E02AC
html-safe-nonce3c0a7fc0999655414b115a4a1d12f4695ed77c23a846f63fa163066f7ac024bb
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJCNUMwOjM1MjNBRjo1QTMwREEyOjc1MUJBQTI6Njk3RTAyQUMiLCJ2aXNpdG9yX2lkIjoiMzQ2MTYyNzg1ODYxNzUwMDMzMyIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac531454bda58086c8de3614130d391952ed557d78cf20d866566145a3daef82ec
hovercard-subject-tagissue:991154828
github-keyboard-shortcutsrepository,issues,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location///voltron/issues_fragments/issue_layout
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/singer-io/singer-python/151/issue_layout
twitter:imagehttps://opengraph.githubassets.com/ba6fd43c7ac72d88517579348bf9cbfc927e9d343272d2cbc0c6c6b4ba7f8a80/singer-io/singer-python/issues/151
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/ba6fd43c7ac72d88517579348bf9cbfc927e9d343272d2cbc0c6c6b4ba7f8a80/singer-io/singer-python/issues/151
og:image:altFor data-type "string", the _transform function just attempts to do str(data) and catches an exception to determine if the string is valid. Binary strings with null bytes or other invalid UTF-8 cha...
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
og:author:usernameKBorders01
hostnamegithub.com
expected-hostnamegithub.com
None60279d4097367e16897439d16d6bbe4180663db828c666eeed2656988ffe59f6
turbo-cache-controlno-preview
go-importgithub.com/singer-io/singer-python git https://github.com/singer-io/singer-python.git
octolytics-dimension-user_id25538203
octolytics-dimension-user_loginsinger-io
octolytics-dimension-repository_id72225524
octolytics-dimension-repository_nwosinger-io/singer-python
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id72225524
octolytics-dimension-repository_network_root_nwosinger-io/singer-python
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release7c85641c598ad130c74f7bcc27f58575cac69551
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/singer-io/singer-python/issues/151#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fsinger-io%2Fsinger-python%2Fissues%2F151
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fsinger-io%2Fsinger-python%2Fissues%2F151
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fissues_fragments%2Fissue_layout&source=header-repo&source_repo=singer-io%2Fsinger-python
Reloadhttps://patch-diff.githubusercontent.com/singer-io/singer-python/issues/151
Reloadhttps://patch-diff.githubusercontent.com/singer-io/singer-python/issues/151
Reloadhttps://patch-diff.githubusercontent.com/singer-io/singer-python/issues/151
singer-io https://patch-diff.githubusercontent.com/singer-io
singer-pythonhttps://patch-diff.githubusercontent.com/singer-io/singer-python
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fsinger-io%2Fsinger-python
Fork 129 https://patch-diff.githubusercontent.com/login?return_to=%2Fsinger-io%2Fsinger-python
Star 576 https://patch-diff.githubusercontent.com/login?return_to=%2Fsinger-io%2Fsinger-python
Code https://patch-diff.githubusercontent.com/singer-io/singer-python
Issues 20 https://patch-diff.githubusercontent.com/singer-io/singer-python/issues
Pull requests 16 https://patch-diff.githubusercontent.com/singer-io/singer-python/pulls
Actions https://patch-diff.githubusercontent.com/singer-io/singer-python/actions
Projects 0 https://patch-diff.githubusercontent.com/singer-io/singer-python/projects
Security 0 https://patch-diff.githubusercontent.com/singer-io/singer-python/security
Insights https://patch-diff.githubusercontent.com/singer-io/singer-python/pulse
Code https://patch-diff.githubusercontent.com/singer-io/singer-python
Issues https://patch-diff.githubusercontent.com/singer-io/singer-python/issues
Pull requests https://patch-diff.githubusercontent.com/singer-io/singer-python/pulls
Actions https://patch-diff.githubusercontent.com/singer-io/singer-python/actions
Projects https://patch-diff.githubusercontent.com/singer-io/singer-python/projects
Security https://patch-diff.githubusercontent.com/singer-io/singer-python/security
Insights https://patch-diff.githubusercontent.com/singer-io/singer-python/pulse
New issuehttps://patch-diff.githubusercontent.com/login?return_to=https://github.com/singer-io/singer-python/issues/151
New issuehttps://patch-diff.githubusercontent.com/login?return_to=https://github.com/singer-io/singer-python/issues/151
Add UTF-8 validity checking to schemahttps://patch-diff.githubusercontent.com/singer-io/singer-python/issues/151#top
https://github.com/KBorders01
https://github.com/KBorders01
KBorders01https://github.com/KBorders01
on Sep 8, 2021https://github.com/singer-io/singer-python/issues/151#issue-991154828
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.