René's URL Explorer Experiment


Title: Saving trained models and their metadata for inference and reproducibility · Issue #41 · PPPLDeepLearning/plasma-python · GitHub

Open Graph Title: Saving trained models and their metadata for inference and reproducibility · Issue #41 · PPPLDeepLearning/plasma-python

X Title: Saving trained models and their metadata for inference and reproducibility · Issue #41 · PPPLDeepLearning/plasma-python

Description: Following discussion on Wednesday 2019-12-04 in FRNN group meeting in San Diego, we need to start systematically saving the best trained models for: Collaboration (no need for multiple users to waste GPU hours retraining the same models)...

Open Graph Description: Following discussion on Wednesday 2019-12-04 in FRNN group meeting in San Diego, we need to start systematically saving the best trained models for: Collaboration (no need for multiple users to was...

X Description: Following discussion on Wednesday 2019-12-04 in FRNN group meeting in San Diego, we need to start systematically saving the best trained models for: Collaboration (no need for multiple users to was...

Opengraph URL: https://github.com/PPPLDeepLearning/plasma-python/issues/41

X: @github

direct link

Domain: github.com


Hey, it has json ld scripts:
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Saving trained models and their metadata for inference and reproducibility","articleBody":"Following discussion on Wednesday 2019-12-04 in FRNN group meeting in San Diego, we need to start systematically saving the best trained models for:\r\n1. Collaboration (no need for multiple users to waste GPU hours retraining the same models)\r\n2. Practical inference (@mdboyer wants a Python interface derived from `performance_analysis.py` that would allow a user to load a trained model and easily feed a set of shot(s) for inference, without using the bloated shot list and preprocessing pipeline that has been oriented towards training for the first phase of the project. Would enable exploratory studies about proximity to disruption, UQ, clustering, etc. This is an important intermediate step to setting up the C-based real-time inference tool in the PCS. ) \r\n3. Reproducibility\r\n\r\nAs a part of a broader effort towards improving reproducibility of our workflow, these models should be stored with:\r\n- `.h5` file containing the tunable parameters (can be directly loaded by Keras or C-translated inference software)\r\n- Input configuration `conf.yaml` and/or dumped final configuration used in specifying and training the model\r\n- Output performance metrics of the trained model (train/validate/test ROC)\r\n- Normalization `.npz` pickled class. For `VarNormalizer`, this would only consist of the standard deviations of each channel of each signal from the set of shots used to train the normalizer. However, it is serialized and saved as a \"fat\" class object that requires the entire `plasma` module to load. Might want to dump a simple non-pickled array, or even `.txt`, alongside the pickle, so that we have a simple file to load with the Keras-C wrapper.\r\n- Some metadata about the layout of a preprocessed shot in `processed_shots/signal_group_*/*.npz` (order of channels and signals, sampling rates, thresholding? etc.), so that any real-time inference wrapper could apply a similar preprocessing to the incoming data.\r\n- **Exact** individual shot numbers used in the training, validation, and testing sets, so that anyone using the model for inference will know if the shot being supplied to the model has already been used to train the model. \r\n- SHA1 of Git commit \r\n- Conda environment; versions of dependencies such as TensorFlow, Keras, PyTorch, scikit-learn\r\n- Computer used for training, MPI library, CuDNN library, etc.\r\n-  Number of devices and MPI ranks used in training (least important)\r\n\r\nGiven the binary `.h5` and `.npz` files, we probably don't want to use VCS to store everything. But we might want to version control the plain-text metadata about the trained models. Store in this repository alongside the code? Or a new repository under our GitHub organization?\r\n\r\nAlso, should we consider ONNX?\r\n\r\n\r\n","author":{"url":"https://github.com/felker","@type":"Person","name":"felker"},"datePublished":"2019-12-05T19:00:54.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":1},"url":"https://github.com/41/plasma-python/issues/41"}

route-pattern/_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format)
route-controllervoltron_issues_fragments
route-actionissue_layout
fetch-noncev2:c3356a2e-ad32-e147-ca2b-ca39c9675d7c
current-catalog-service-hash81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114
request-id87F2:10383D:6229BB:7F7341:698E8E6B
html-safe-nonce973d206bf162855d50b7158db33f69dcb8d8402bd969b7485ec59e6f9fd37a7d
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI4N0YyOjEwMzgzRDo2MjI5QkI6N0Y3MzQxOjY5OEU4RTZCIiwidmlzaXRvcl9pZCI6IjYzNjM0NjQxNjUwMzAyNjg1MjMiLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ==
visitor-hmac2da5c40f6fb72c8a17a69a4066d320e239dddf8f48445d384011c31b9d8f656e
hovercard-subject-tagissue:533535047
github-keyboard-shortcutsrepository,issues,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location///voltron/issues_fragments/issue_layout
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/PPPLDeepLearning/plasma-python/41/issue_layout
twitter:imagehttps://opengraph.githubassets.com/28cf95e5167c565e9621192571ae8c6d68f6ac931a4ff9bab0c16528d7a5f63a/PPPLDeepLearning/plasma-python/issues/41
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/28cf95e5167c565e9621192571ae8c6d68f6ac931a4ff9bab0c16528d7a5f63a/PPPLDeepLearning/plasma-python/issues/41
og:image:altFollowing discussion on Wednesday 2019-12-04 in FRNN group meeting in San Diego, we need to start systematically saving the best trained models for: Collaboration (no need for multiple users to was...
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
og:author:usernamefelker
hostnamegithub.com
expected-hostnamegithub.com
Nonecb2828a801ee6b7be618f3ac76fbf55def35bbc30f053a9c41bf90210b8b72ba
turbo-cache-controlno-preview
go-importgithub.com/PPPLDeepLearning/plasma-python git https://github.com/PPPLDeepLearning/plasma-python.git
octolytics-dimension-user_id23219101
octolytics-dimension-user_loginPPPLDeepLearning
octolytics-dimension-repository_id72968591
octolytics-dimension-repository_nwoPPPLDeepLearning/plasma-python
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id72968591
octolytics-dimension-repository_network_root_nwoPPPLDeepLearning/plasma-python
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
releasee6b91a7e6e46287d26887e3fb7a4161657bab8f7
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://github.com/PPPLDeepLearning/plasma-python/issues/41#start-of-content
https://github.com/
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FPPPLDeepLearning%2Fplasma-python%2Fissues%2F41
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FPPPLDeepLearning%2Fplasma-python%2Fissues%2F41
Sign up https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fissues_fragments%2Fissue_layout&source=header-repo&source_repo=PPPLDeepLearning%2Fplasma-python
Reloadhttps://github.com/PPPLDeepLearning/plasma-python/issues/41
Reloadhttps://github.com/PPPLDeepLearning/plasma-python/issues/41
Reloadhttps://github.com/PPPLDeepLearning/plasma-python/issues/41
PPPLDeepLearning https://github.com/PPPLDeepLearning
plasma-pythonhttps://github.com/PPPLDeepLearning/plasma-python
Notifications https://github.com/login?return_to=%2FPPPLDeepLearning%2Fplasma-python
Fork 43 https://github.com/login?return_to=%2FPPPLDeepLearning%2Fplasma-python
Star 88 https://github.com/login?return_to=%2FPPPLDeepLearning%2Fplasma-python
Code https://github.com/PPPLDeepLearning/plasma-python
Issues 21 https://github.com/PPPLDeepLearning/plasma-python/issues
Pull requests 1 https://github.com/PPPLDeepLearning/plasma-python/pulls
Actions https://github.com/PPPLDeepLearning/plasma-python/actions
Projects 0 https://github.com/PPPLDeepLearning/plasma-python/projects
Security 0 https://github.com/PPPLDeepLearning/plasma-python/security
Insights https://github.com/PPPLDeepLearning/plasma-python/pulse
Code https://github.com/PPPLDeepLearning/plasma-python
Issues https://github.com/PPPLDeepLearning/plasma-python/issues
Pull requests https://github.com/PPPLDeepLearning/plasma-python/pulls
Actions https://github.com/PPPLDeepLearning/plasma-python/actions
Projects https://github.com/PPPLDeepLearning/plasma-python/projects
Security https://github.com/PPPLDeepLearning/plasma-python/security
Insights https://github.com/PPPLDeepLearning/plasma-python/pulse
New issuehttps://github.com/login?return_to=https://github.com/PPPLDeepLearning/plasma-python/issues/41
New issuehttps://github.com/login?return_to=https://github.com/PPPLDeepLearning/plasma-python/issues/41
Saving trained models and their metadata for inference and reproducibilityhttps://github.com/PPPLDeepLearning/plasma-python/issues/41#top
https://github.com/felker
https://github.com/felker
https://github.com/felker
felkerhttps://github.com/felker
on Dec 5, 2019https://github.com/PPPLDeepLearning/plasma-python/issues/41#issue-533535047
@mdboyerhttps://github.com/mdboyer
felkerhttps://github.com/felker
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.