René's URL Explorer Experiment

Title: Some of the step tasks have been OOM Killed. · Issue #189 · modAL-python/modAL · GitHub

Open Graph Title: Some of the step tasks have been OOM Killed. · Issue #189 · modAL-python/modAL

X Title: Some of the step tasks have been OOM Killed. · Issue #189 · modAL-python/modAL

Description: I am facing "oom_kill event in StepId=866679.batch. Some of the step tasks have been OOM Killed." while using avg_confidence strategy for my multilabel dataset with around 38000 images of size 224. I use torch Dataloader with batch size ...

Open Graph Description: I am facing "oom_kill event in StepId=866679.batch. Some of the step tasks have been OOM Killed." while using avg_confidence strategy for my multilabel dataset with around 38000 images of size 224....

X Description: I am facing "oom_kill event in StepId=866679.batch. Some of the step tasks have been OOM Killed." while using avg_confidence strategy for my multilabel dataset with around 38000 images of...

Opengraph URL: https://github.com/modAL-python/modAL/issues/189

X: @github

direct link

Domain: patch-diff.githubusercontent.com

Hey, it has json ld scripts:

{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Some of the step tasks have been OOM Killed.","articleBody":"I am facing \"oom_kill event in StepId=866679.batch. Some of the step tasks have been OOM Killed.\" while using avg_confidence strategy for my multilabel dataset with around 38000 images of size 224. I use torch Dataloader with batch size 8 to load the data. Here's a snippet of the code covering Active Learning loop -\r\n\r\nn_queries = 14\r\nfor i in range(n_queries):\r\n    if i == 0:\r\n        n_instances = 8\r\n    else:\r\n        power += 0.25\r\n        n_instances = batch(int(np.ceil(np.power(10, power))), batch_size)\r\n    total_samples += n_instances\r\n    n_instances_list.append(total_samples)\r\n    \r\n    print(f\"\\nQuery {i + 1}: Requesting {n_instances} samples.\")\r\n    print(f\"Number of samples in pool before query: {X_pool.shape[0]}\")\r\n\r\n    \r\n\r\n    with torch.device(\"cpu\"):\r\n        query_idx, _ = learner.query(X_pool, n_instances=n_instances) \r\n        query_idx = np.unique(query_idx)\r\n        query_idx = np.array(query_idx).flatten() \r\n\r\n    # Extract the samples based on the query indices\r\n    X_query = X_pool[query_idx]\r\n    y_query = y_pool[query_idx]\r\n    filenames_query = [filenames_pool[idx] for idx in query_idx]\r\n\r\n    print(\"Shape of X_query after indexing:\", X_query.shape)\r\n\r\n    if X_query.ndim != 4:\r\n        raise ValueError(f\"Unexpected number of dimensions in X_query: {X_query.ndim}\")\r\n    if X_query.shape[1:] != (224, 224, 3):\r\n        raise ValueError(f\"Unexpected shape in X_query dimensions: {X_query.shape}\")\r\n\r\n    X_cumulative = np.vstack((X_cumulative, X_query))\r\n    y_cumulative = np.vstack((y_cumulative, y_query))\r\n    filenames_cumulative.extend(filenames_query)\r\n\r\n    save_checkpoint(i + 1, X_cumulative, y_cumulative, filenames_cumulative, save_dir)\r\n\r\n    learner.teach(X=X_cumulative, y=y_cumulative)\r\n\r\n    y_pred = learner.predict(X_test_np)\r\n    accuracy = accuracy_score(y_test_np, y_pred)\r\n    f1 = f1_score(y_test_np, y_pred, average='macro')\r\n    acc_test_data.append(accuracy)\r\n    f1_test_data.append(f1)\r\n\r\n    print(f\"Accuracy after query {i + 1}: {accuracy}\")\r\n    print(f\"F1 Score after query {i + 1}: {f1}\")\r\n\r\n\r\n    # Early stopping check\r\n    if f1 \u003e best_f1_score:\r\n        best_f1_score = f1\r\n        wait = 0  # reset the wait counter\r\n    else:\r\n        wait += 1  # increment the wait counter\r\n        if wait \u003e= patience:\r\n            print(\"Stopping early due to no improvement in F1 score.\")\r\n            break\r\n\r\n    # Remove queried instances from the pool\r\n    X_pool = np.delete(X_pool, query_idx, axis=0)\r\n    y_pool = np.delete(y_pool, query_idx, axis=0)\r\n    filenames_pool = [filename for idx, filename in enumerate(filenames_pool) if idx not in query_idx]\r\n    print(f\"Number of samples in pool after query: {X_pool.shape[0]}\")\r\n\r\nThis code runs well till 11 iterations but in the 12th iteration I get the OOM kill error. \r\n\r\nI am using A100 GPU with 40GB RAM which should be sufficient for this loop. Could you please help me identify what could be going wrong which leads to excessive memory requirement. Is there a bottleneck in my code that I should address? Could it be the case that for every iterarion the data is held in the main memory and can it be freed somehow without breaking the code and distorting the results. ","author":{"url":"https://github.com/shubhamgp47","@type":"Person","name":"shubhamgp47"},"datePublished":"2024-07-27T13:07:40.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":0},"url":"https://github.com/189/modAL/issues/189"}

route-pattern	/_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format)
route-controller	voltron_issues_fragments
route-action	issue_layout
fetch-nonce	v2:11a86914-d4af-b744-c77f-58b6c22636d1
current-catalog-service-hash	81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114
request-id	D874:170A5:ADEFDF:E2EE13:698EBD36
html-safe-nonce	53dc6409c48f9e1b33dca906b35e9e77b41223bd8bdb57a76334b179b9e9a07c
visitor-payload	eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJEODc0OjE3MEE1OkFERUZERjpFMkVFMTM6Njk4RUJEMzYiLCJ2aXNpdG9yX2lkIjoiNTE4MTQxOTg5NTkzODUzMDYxNCIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac	03d673899e879e90da9c85f42e4f7338f5fef3ff635b69df72e978d965bd05dd
hovercard-subject-tag	issue:2433474346
github-keyboard-shortcuts	repository,issues,copilot
google-site-verification	Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-url	https://collector.github.com/github/collect
analytics-location	///voltron/issues_fragments/issue_layout
fb:app_id	1401488693436528
apple-itunes-app	app-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/modAL-python/modAL/189/issue_layout
twitter:image	https://opengraph.githubassets.com/f6109675c567a872f4b75170b31d6ad8dd5df99beb36d20a6cdca85f769cd781/modAL-python/modAL/issues/189
twitter:card	summary_large_image
og:image	https://opengraph.githubassets.com/f6109675c567a872f4b75170b31d6ad8dd5df99beb36d20a6cdca85f769cd781/modAL-python/modAL/issues/189
og:image:alt	I am facing "oom_kill event in StepId=866679.batch. Some of the step tasks have been OOM Killed." while using avg_confidence strategy for my multilabel dataset with around 38000 images of size 224....
og:image:width	1200
og:image:height	600
og:site_name	GitHub
og:type	object
og:author:username	shubhamgp47
hostname	github.com
expected-hostname	github.com
None	cb2828a801ee6b7be618f3ac76fbf55def35bbc30f053a9c41bf90210b8b72ba
turbo-cache-control	no-preview
go-import	github.com/modAL-python/modAL git https://github.com/modAL-python/modAL.git
octolytics-dimension-user_id	42179679
octolytics-dimension-user_login	modAL-python
octolytics-dimension-repository_id	110697473
octolytics-dimension-repository_nwo	modAL-python/modAL
octolytics-dimension-repository_public	true
octolytics-dimension-repository_is_fork	false
octolytics-dimension-repository_network_root_id	110697473
octolytics-dimension-repository_network_root_nwo	modAL-python/modAL
turbo-body-classes	logged-out env-production page-responsive
disable-turbo	false
browser-stats-url	https://api.github.com/_private/browser/stats
browser-errors-url	https://api.github.com/_private/browser/errors
release	e6b91a7e6e46287d26887e3fb7a4161657bab8f7
ui-target	full
theme-color	#1e2327
color-scheme	light dark

Links:

Skip to content	https://patch-diff.githubusercontent.com/modAL-python/modAL/issues/189#start-of-content
	https://patch-diff.githubusercontent.com/
Sign in	https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FmodAL-python%2FmodAL%2Fissues%2F189
GitHub CopilotWrite better code with AI	https://github.com/features/copilot
GitHub SparkBuild and deploy intelligent apps	https://github.com/features/spark
GitHub ModelsManage and compare prompts	https://github.com/features/models
MCP RegistryNewIntegrate external tools	https://github.com/mcp
ActionsAutomate any workflow	https://github.com/features/actions
CodespacesInstant dev environments	https://github.com/features/codespaces
IssuesPlan and track work	https://github.com/features/issues
Code ReviewManage code changes	https://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilities	https://github.com/security/advanced-security
Code securitySecure your code as you build	https://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they start	https://github.com/security/advanced-security/secret-protection
Why GitHub	https://github.com/why-github
Documentation	https://docs.github.com
Blog	https://github.blog
Changelog	https://github.blog/changelog
Marketplace	https://github.com/marketplace
View all features	https://github.com/features
Enterprises	https://github.com/enterprise
Small and medium teams	https://github.com/team
Startups	https://github.com/enterprise/startups
Nonprofits	https://github.com/solutions/industry/nonprofits
App Modernization	https://github.com/solutions/use-case/app-modernization
DevSecOps	https://github.com/solutions/use-case/devsecops
DevOps	https://github.com/solutions/use-case/devops
CI/CD	https://github.com/solutions/use-case/ci-cd
View all use cases	https://github.com/solutions/use-case
Healthcare	https://github.com/solutions/industry/healthcare
Financial services	https://github.com/solutions/industry/financial-services
Manufacturing	https://github.com/solutions/industry/manufacturing
Government	https://github.com/solutions/industry/government
View all industries	https://github.com/solutions/industry
View all solutions	https://github.com/solutions
AI	https://github.com/resources/articles?topic=ai
Software Development	https://github.com/resources/articles?topic=software-development
DevOps	https://github.com/resources/articles?topic=devops
Security	https://github.com/resources/articles?topic=security
View all topics	https://github.com/resources/articles
Customer stories	https://github.com/customer-stories
Events & webinars	https://github.com/resources/events
Ebooks & reports	https://github.com/resources/whitepapers
Business insights	https://github.com/solutions/executive-insights
GitHub Skills	https://skills.github.com
Documentation	https://docs.github.com
Customer support	https://support.github.com
Community forum	https://github.com/orgs/community/discussions
Trust center	https://github.com/trust-center
Partners	https://github.com/partners
GitHub SponsorsFund open source developers	https://github.com/sponsors
Security Lab	https://securitylab.github.com
Maintainer Community	https://maintainers.github.com
Accelerator	https://github.com/accelerator
Archive Program	https://archiveprogram.github.com
Topics	https://github.com/topics
Trending	https://github.com/trending
Collections	https://github.com/collections
Enterprise platformAI-powered developer platform	https://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security features	https://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI features	https://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 support	https://github.com/premium-support
Pricing	https://github.com/pricing
Search syntax tips	https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentation	https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in	https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FmodAL-python%2FmodAL%2Fissues%2F189
Sign up	https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fissues_fragments%2Fissue_layout&source=header-repo&source_repo=modAL-python%2FmodAL
Reload	https://patch-diff.githubusercontent.com/modAL-python/modAL/issues/189
Reload	https://patch-diff.githubusercontent.com/modAL-python/modAL/issues/189
Reload	https://patch-diff.githubusercontent.com/modAL-python/modAL/issues/189
modAL-python	https://patch-diff.githubusercontent.com/modAL-python
modAL	https://patch-diff.githubusercontent.com/modAL-python/modAL
Notifications	https://patch-diff.githubusercontent.com/login?return_to=%2FmodAL-python%2FmodAL
Fork 327	https://patch-diff.githubusercontent.com/login?return_to=%2FmodAL-python%2FmodAL
Star 2.3k	https://patch-diff.githubusercontent.com/login?return_to=%2FmodAL-python%2FmodAL
Code	https://patch-diff.githubusercontent.com/modAL-python/modAL
Issues 93	https://patch-diff.githubusercontent.com/modAL-python/modAL/issues
Pull requests 12	https://patch-diff.githubusercontent.com/modAL-python/modAL/pulls
Actions	https://patch-diff.githubusercontent.com/modAL-python/modAL/actions
Projects 0	https://patch-diff.githubusercontent.com/modAL-python/modAL/projects
Wiki	https://patch-diff.githubusercontent.com/modAL-python/modAL/wiki
Security 0	https://patch-diff.githubusercontent.com/modAL-python/modAL/security
Insights	https://patch-diff.githubusercontent.com/modAL-python/modAL/pulse
Code	https://patch-diff.githubusercontent.com/modAL-python/modAL
Issues	https://patch-diff.githubusercontent.com/modAL-python/modAL/issues
Pull requests	https://patch-diff.githubusercontent.com/modAL-python/modAL/pulls
Actions	https://patch-diff.githubusercontent.com/modAL-python/modAL/actions
Projects	https://patch-diff.githubusercontent.com/modAL-python/modAL/projects
Wiki	https://patch-diff.githubusercontent.com/modAL-python/modAL/wiki
Security	https://patch-diff.githubusercontent.com/modAL-python/modAL/security
Insights	https://patch-diff.githubusercontent.com/modAL-python/modAL/pulse
New issue	https://patch-diff.githubusercontent.com/login?return_to=https://github.com/modAL-python/modAL/issues/189
New issue	https://patch-diff.githubusercontent.com/login?return_to=https://github.com/modAL-python/modAL/issues/189
Some of the step tasks have been OOM Killed.	https://patch-diff.githubusercontent.com/modAL-python/modAL/issues/189#top
	https://github.com/shubhamgp47
	https://github.com/shubhamgp47
shubhamgp47	https://github.com/shubhamgp47
on Jul 27, 2024	https://github.com/modAL-python/modAL/issues/189#issue-2433474346
	https://github.com
Terms	https://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacy	https://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Security	https://github.com/security
Status	https://www.githubstatus.com/
Community	https://github.community/
Docs	https://docs.github.com/
Contact	https://support.github.com?tags=dotcom-footer

Viewport: width=device-width

URLs of crawlers that visited me.