René's URL Explorer Experiment


Title: GitHub - BigDataInterviewHub/BigDataInterview: 大数据面试题,持续更新

Open Graph Title: GitHub - BigDataInterviewHub/BigDataInterview: 大数据面试题,持续更新

X Title: GitHub - BigDataInterviewHub/BigDataInterview: 大数据面试题,持续更新

Description: 大数据面试题,持续更新. Contribute to BigDataInterviewHub/BigDataInterview development by creating an account on GitHub.

Open Graph Description: 大数据面试题,持续更新. Contribute to BigDataInterviewHub/BigDataInterview development by creating an account on GitHub.

X Description: 大数据面试题,持续更新. Contribute to BigDataInterviewHub/BigDataInterview development by creating an account on GitHub.

Opengraph URL: https://github.com/BigDataInterviewHub/BigDataInterview

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:1a0eaa6b-5a90-7f9d-8ff1-5da01ea36e2f
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idC68E:327FB4:8A94A5:BD7861:696FEB66
html-safe-nonce97b8c8548a35201af40a9943d71a67fc9235002ba21ea5416fb2456d3d9ef931
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJDNjhFOjMyN0ZCNDo4QTk0QTU6QkQ3ODYxOjY5NkZFQjY2IiwidmlzaXRvcl9pZCI6IjY3OTA2ODEwMjI5MDgzMjg4MDYiLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ==
visitor-hmac7131195258758d07e5861dc658f77f9e78e39d0edee95d34d71d3c4d04c67b47
hovercard-subject-tagrepository:349438728
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/BigDataInterviewHub/BigDataInterview
twitter:imagehttps://opengraph.githubassets.com/b6c09667890edd1c1fc416c2565c6fbe45bd1b9b4551e8007e579dbb45ce2b01/BigDataInterviewHub/BigDataInterview
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/b6c09667890edd1c1fc416c2565c6fbe45bd1b9b4551e8007e579dbb45ce2b01/BigDataInterviewHub/BigDataInterview
og:image:alt大数据面试题,持续更新. Contribute to BigDataInterviewHub/BigDataInterview development by creating an account on GitHub.
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None2b218dbdee134592a2dbfabd454a1070986f1fbedb8334bf06b8f2ccc3449130
turbo-cache-controlno-preview
go-importgithub.com/BigDataInterviewHub/BigDataInterview git https://github.com/BigDataInterviewHub/BigDataInterview.git
octolytics-dimension-user_id80965930
octolytics-dimension-user_loginBigDataInterviewHub
octolytics-dimension-repository_id349438728
octolytics-dimension-repository_nwoBigDataInterviewHub/BigDataInterview
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id349438728
octolytics-dimension-repository_network_root_nwoBigDataInterviewHub/BigDataInterview
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
releasebcaac379a58a3ed99a1b0502e2a8f5cfd3a7b54b
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FBigDataInterviewHub%2FBigDataInterview
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FBigDataInterviewHub%2FBigDataInterview
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=BigDataInterviewHub%2FBigDataInterview
Reloadhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview
Reloadhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview
Reloadhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview
BigDataInterviewHub https://patch-diff.githubusercontent.com/BigDataInterviewHub
BigDataInterviewhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2FBigDataInterviewHub%2FBigDataInterview
Fork 2 https://patch-diff.githubusercontent.com/login?return_to=%2FBigDataInterviewHub%2FBigDataInterview
Star 4 https://patch-diff.githubusercontent.com/login?return_to=%2FBigDataInterviewHub%2FBigDataInterview
4 stars https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/stargazers
2 forks https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/forks
Branches https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/branches
Tags https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/tags
Activity https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2FBigDataInterviewHub%2FBigDataInterview
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2FBigDataInterviewHub%2FBigDataInterview
Code https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview
Issues 0 https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/issues
Pull requests 0 https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/pulls
Actions https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/actions
Projects 0 https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/projects
Security Uh oh! There was an error while loading. Please reload this page. https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/security
Please reload this pagehttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview
Insights https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/pulse
Code https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview
Issues https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/issues
Pull requests https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/pulls
Actions https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/actions
Projects https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/projects
Security https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/security
Insights https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/pulse
Brancheshttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/branches
Tagshttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/tags
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/branches
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/tags
7 Commitshttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/commits/main/
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/commits/main/
Cassandra.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md
Cassandra.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md
Cboard_Kylin.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md
Cboard_Kylin.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md
Flink.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md
Flink.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md
Flume.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md
Flume.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md
HBase.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md
HBase.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md
Hadoop.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md
Hadoop.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md
Hive.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md
Hive.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md
Impala.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Impala.md
Impala.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Impala.md
Oozie.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md
Oozie.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md
Presto.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md
Presto.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md
README.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/README.md
README.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/README.md
Spark.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md
Spark.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md
Sqoop.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Sqoop.md
Sqoop.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Sqoop.md
Storm.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md
Storm.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md
Yarn.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md
Yarn.mdhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md
READMEhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#bigdatainterview
https://github.com/JavaInterviewHub/JavaInterview/blob/main/imgs/%E7%A8%8B%E5%BA%8F%E5%91%98%E7%99%BE%E7%A7%91%E5%85%A8%E4%B9%A6.jpg
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#cassandra
1.向Cassandra讲解https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#1%E5%90%91cassandra%E8%AE%B2%E8%A7%A3
2.Cassandra用哪种语言写?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#2cassandra%E7%94%A8%E5%93%AA%E7%A7%8D%E8%AF%AD%E8%A8%80%E5%86%99
3.Cassandra(Cassandra)的原始作者是谁?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#3cassandracassandra%E7%9A%84%E5%8E%9F%E5%A7%8B%E4%BD%9C%E8%80%85%E6%98%AF%E8%B0%81
4.Cassandra数据库中使用哪种查询语言?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#4cassandra%E6%95%B0%E6%8D%AE%E5%BA%93%E4%B8%AD%E4%BD%BF%E7%94%A8%E5%93%AA%E7%A7%8D%E6%9F%A5%E8%AF%A2%E8%AF%AD%E8%A8%80
5.Cassandra的优点/优点是什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#5cassandra%E7%9A%84%E4%BC%98%E7%82%B9%E4%BC%98%E7%82%B9%E6%98%AF%E4%BB%80%E4%B9%88
6.是否提到了Cassandra数据模型的一些重要组成部分?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#6%E6%98%AF%E5%90%A6%E6%8F%90%E5%88%B0%E4%BA%86cassandra%E6%95%B0%E6%8D%AE%E6%A8%A1%E5%9E%8B%E7%9A%84%E4%B8%80%E4%BA%9B%E9%87%8D%E8%A6%81%E7%BB%84%E6%88%90%E9%83%A8%E5%88%86
5.数据模型(Data Model)https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#5%E6%95%B0%E6%8D%AE%E6%A8%A1%E5%9E%8Bdata-model
6.列(Colunmn)https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#6%E5%88%97colunmn
7.列族(Column Family)https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#7%E5%88%97%E6%97%8Fcolumn-family
8.超列族(Super Column Family)https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#8%E8%B6%85%E5%88%97%E6%97%8Fsuper-column-family
9.KeySpaceshttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#9keyspaces
10.Clustershttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#10clusters
11.Cassandra的其他成分是什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#11cassandra%E7%9A%84%E5%85%B6%E4%BB%96%E6%88%90%E5%88%86%E6%98%AF%E4%BB%80%E4%B9%88
12.Cassandra中有哪些不同的组合键?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#12cassandra%E4%B8%AD%E6%9C%89%E5%93%AA%E4%BA%9B%E4%B8%8D%E5%90%8C%E7%9A%84%E7%BB%84%E5%90%88%E9%94%AE
13.什么是Cassandra中的数据复制?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#13%E4%BB%80%E4%B9%88%E6%98%AFcassandra%E4%B8%AD%E7%9A%84%E6%95%B0%E6%8D%AE%E5%A4%8D%E5%88%B6
14.Cassandra中的数据中心是什么意思?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#14cassandra%E4%B8%AD%E7%9A%84%E6%95%B0%E6%8D%AE%E4%B8%AD%E5%BF%83%E6%98%AF%E4%BB%80%E4%B9%88%E6%84%8F%E6%80%9D
15.cassandra用了哪些端口?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#15cassandra%E7%94%A8%E4%BA%86%E5%93%AA%E4%BA%9B%E7%AB%AF%E5%8F%A3
16.是不是单个seed意味着单点故障?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#16%E6%98%AF%E4%B8%8D%E6%98%AF%E5%8D%95%E4%B8%AAseed%E6%84%8F%E5%91%B3%E7%9D%80%E5%8D%95%E7%82%B9%E6%95%85%E9%9A%9C
17.为什么不可以在jconsole里调用某个jmx方法呢?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#17%E4%B8%BA%E4%BB%80%E4%B9%88%E4%B8%8D%E5%8F%AF%E4%BB%A5%E5%9C%A8jconsole%E9%87%8C%E8%B0%83%E7%94%A8%E6%9F%90%E4%B8%AAjmx%E6%96%B9%E6%B3%95%E5%91%A2
18.为什么我会在日志文件里看到 “… messages dropped …”这样的信息?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#18%E4%B8%BA%E4%BB%80%E4%B9%88%E6%88%91%E4%BC%9A%E5%9C%A8%E6%97%A5%E5%BF%97%E6%96%87%E4%BB%B6%E9%87%8C%E7%9C%8B%E5%88%B0--messages-dropped-%E8%BF%99%E6%A0%B7%E7%9A%84%E4%BF%A1%E6%81%AF
19.Cassandra因为java.lang.OutOfMemoryError: Map failed挂掉了https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#19cassandra%E5%9B%A0%E4%B8%BAjavalangoutofmemoryerror-map-failed%E6%8C%82%E6%8E%89%E4%BA%86
20.如果再同一时刻发生两次更新会发生什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#20%E5%A6%82%E6%9E%9C%E5%86%8D%E5%90%8C%E4%B8%80%E6%97%B6%E5%88%BB%E5%8F%91%E7%94%9F%E4%B8%A4%E6%AC%A1%E6%9B%B4%E6%96%B0%E4%BC%9A%E5%8F%91%E7%94%9F%E4%BB%80%E4%B9%88
21.为什么在加入一个新节点的时候,会有Stream failed错误?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#21%E4%B8%BA%E4%BB%80%E4%B9%88%E5%9C%A8%E5%8A%A0%E5%85%A5%E4%B8%80%E4%B8%AA%E6%96%B0%E8%8A%82%E7%82%B9%E7%9A%84%E6%97%B6%E5%80%99%E4%BC%9A%E6%9C%89stream-failed%E9%94%99%E8%AF%AF
参考链接https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cassandra.md#%E5%8F%82%E8%80%83%E9%93%BE%E6%8E%A5
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#cboard--kylin
1.CBoardhttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#1cboard
2.CBoard特性https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#2cboard%E7%89%B9%E6%80%A7
3.Kylin的优点和缺点?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#3kylin%E7%9A%84%E4%BC%98%E7%82%B9%E5%92%8C%E7%BC%BA%E7%82%B9
4.Kylin的rowkey如何设计?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#4kylin%E7%9A%84rowkey%E5%A6%82%E4%BD%95%E8%AE%BE%E8%AE%A1
5.Kylin的cuboid,cube和segment的关系?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#5kylin%E7%9A%84cuboidcube%E5%92%8Csegment%E7%9A%84%E5%85%B3%E7%B3%BB
6.一张hive宽表有5个维度,kylin构建cube的时候我选了4个维度,我select *的时候会有几个维度字段?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#6%E4%B8%80%E5%BC%A0hive%E5%AE%BD%E8%A1%A8%E6%9C%895%E4%B8%AA%E7%BB%B4%E5%BA%A6kylin%E6%9E%84%E5%BB%BAcube%E7%9A%84%E6%97%B6%E5%80%99%E6%88%91%E9%80%89%E4%BA%864%E4%B8%AA%E7%BB%B4%E5%BA%A6%E6%88%91select-%E7%9A%84%E6%97%B6%E5%80%99%E4%BC%9A%E6%9C%89%E5%87%A0%E4%B8%AA%E7%BB%B4%E5%BA%A6%E5%AD%97%E6%AE%B5
7.其他olap工具有了解过吗?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#7%E5%85%B6%E4%BB%96olap%E5%B7%A5%E5%85%B7%E6%9C%89%E4%BA%86%E8%A7%A3%E8%BF%87%E5%90%97
8.kylin你一般怎么调优https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#8kylin%E4%BD%A0%E4%B8%80%E8%88%AC%E6%80%8E%E4%B9%88%E8%B0%83%E4%BC%98
9.kylin的原理和优化?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#9kylin%E7%9A%84%E5%8E%9F%E7%90%86%E5%92%8C%E4%BC%98%E5%8C%96
10.为什么kylin的维度不建议过多?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#10%E4%B8%BA%E4%BB%80%E4%B9%88kylin%E7%9A%84%E7%BB%B4%E5%BA%A6%E4%B8%8D%E5%BB%BA%E8%AE%AE%E8%BF%87%E5%A4%9A
11.Kylin cube的构建过程是怎么样的?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#11kylin-cube%E7%9A%84%E6%9E%84%E5%BB%BA%E8%BF%87%E7%A8%8B%E6%98%AF%E6%80%8E%E4%B9%88%E6%A0%B7%E7%9A%84
12.Kylin的构建算法https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#12kylin%E7%9A%84%E6%9E%84%E5%BB%BA%E7%AE%97%E6%B3%95
13.cube优化?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#13cube%E4%BC%98%E5%8C%96
14.什么叫全量构建?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#14%E4%BB%80%E4%B9%88%E5%8F%AB%E5%85%A8%E9%87%8F%E6%9E%84%E5%BB%BA
15.怎么样实现自动增量构建?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#15%E6%80%8E%E4%B9%88%E6%A0%B7%E5%AE%9E%E7%8E%B0%E8%87%AA%E5%8A%A8%E5%A2%9E%E9%87%8F%E6%9E%84%E5%BB%BA
16.怎样实现在自己的web系统中查询kylin 的数据?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#16%E6%80%8E%E6%A0%B7%E5%AE%9E%E7%8E%B0%E5%9C%A8%E8%87%AA%E5%B7%B1%E7%9A%84web%E7%B3%BB%E7%BB%9F%E4%B8%AD%E6%9F%A5%E8%AF%A2kylin-%E7%9A%84%E6%95%B0%E6%8D%AE
参考链接https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Cboard_Kylin.md#%E5%8F%82%E8%80%83%E9%93%BE%E6%8E%A5
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#flink
1.简单介绍一下 Flinkhttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#1%E7%AE%80%E5%8D%95%E4%BB%8B%E7%BB%8D%E4%B8%80%E4%B8%8B-flink
2.Flink 相比传统的 Spark Streaming 有什么区别?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#2flink-%E7%9B%B8%E6%AF%94%E4%BC%A0%E7%BB%9F%E7%9A%84-spark-streaming-%E6%9C%89%E4%BB%80%E4%B9%88%E5%8C%BA%E5%88%AB
3.Flink 的运行必须依赖 Hadoop组件吗?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#3flink-%E7%9A%84%E8%BF%90%E8%A1%8C%E5%BF%85%E9%A1%BB%E4%BE%9D%E8%B5%96-hadoop%E7%BB%84%E4%BB%B6%E5%90%97
4.Flink集群有哪些角色?各自有什么作用?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#4flink%E9%9B%86%E7%BE%A4%E6%9C%89%E5%93%AA%E4%BA%9B%E8%A7%92%E8%89%B2%E5%90%84%E8%87%AA%E6%9C%89%E4%BB%80%E4%B9%88%E4%BD%9C%E7%94%A8
5.说说 Flink 资源管理中 Task Slot 的概念https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#5%E8%AF%B4%E8%AF%B4-flink-%E8%B5%84%E6%BA%90%E7%AE%A1%E7%90%86%E4%B8%AD-task-slot-%E7%9A%84%E6%A6%82%E5%BF%B5
6.说说 Flink 的常用算子?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#6%E8%AF%B4%E8%AF%B4-flink-%E7%9A%84%E5%B8%B8%E7%94%A8%E7%AE%97%E5%AD%90
7.说说你知道的Flink分区策略?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#7%E8%AF%B4%E8%AF%B4%E4%BD%A0%E7%9F%A5%E9%81%93%E7%9A%84flink%E5%88%86%E5%8C%BA%E7%AD%96%E7%95%A5
8.Flink的并行度了解吗?Flink的并行度设置是怎样的?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#8flink%E7%9A%84%E5%B9%B6%E8%A1%8C%E5%BA%A6%E4%BA%86%E8%A7%A3%E5%90%97flink%E7%9A%84%E5%B9%B6%E8%A1%8C%E5%BA%A6%E8%AE%BE%E7%BD%AE%E6%98%AF%E6%80%8E%E6%A0%B7%E7%9A%84
9.Flink的Slot和parallelism有什么区别?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#9flink%E7%9A%84slot%E5%92%8Cparallelism%E6%9C%89%E4%BB%80%E4%B9%88%E5%8C%BA%E5%88%AB
10.Flink有没有重启策略?说说有哪几种?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#10flink%E6%9C%89%E6%B2%A1%E6%9C%89%E9%87%8D%E5%90%AF%E7%AD%96%E7%95%A5%E8%AF%B4%E8%AF%B4%E6%9C%89%E5%93%AA%E5%87%A0%E7%A7%8D
11.用过Flink中的分布式缓存吗?如何使用?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#11%E7%94%A8%E8%BF%87flink%E4%B8%AD%E7%9A%84%E5%88%86%E5%B8%83%E5%BC%8F%E7%BC%93%E5%AD%98%E5%90%97%E5%A6%82%E4%BD%95%E4%BD%BF%E7%94%A8
12.说说Flink中的广播变量,使用时需要注意什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#12%E8%AF%B4%E8%AF%B4flink%E4%B8%AD%E7%9A%84%E5%B9%BF%E6%92%AD%E5%8F%98%E9%87%8F%E4%BD%BF%E7%94%A8%E6%97%B6%E9%9C%80%E8%A6%81%E6%B3%A8%E6%84%8F%E4%BB%80%E4%B9%88
13.说说Flink中的窗口?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#13%E8%AF%B4%E8%AF%B4flink%E4%B8%AD%E7%9A%84%E7%AA%97%E5%8F%A3
14.说说Flink中的状态存储?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#14%E8%AF%B4%E8%AF%B4flink%E4%B8%AD%E7%9A%84%E7%8A%B6%E6%80%81%E5%AD%98%E5%82%A8
15.Flink 中的时间有哪几类?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#15flink-%E4%B8%AD%E7%9A%84%E6%97%B6%E9%97%B4%E6%9C%89%E5%93%AA%E5%87%A0%E7%B1%BB
16.Flink 中水印是什么概念,起到什么作用?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#16flink-%E4%B8%AD%E6%B0%B4%E5%8D%B0%E6%98%AF%E4%BB%80%E4%B9%88%E6%A6%82%E5%BF%B5%E8%B5%B7%E5%88%B0%E4%BB%80%E4%B9%88%E4%BD%9C%E7%94%A8
17.Flink Table & SQL 熟悉吗?TableEnvironment这个类有什么作用?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#17flink-table--sql-%E7%86%9F%E6%82%89%E5%90%97tableenvironment%E8%BF%99%E4%B8%AA%E7%B1%BB%E6%9C%89%E4%BB%80%E4%B9%88%E4%BD%9C%E7%94%A8
18.Flink SQL的实现原理是什么?是如何实现 SQL 解析的呢?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#18flink-sql%E7%9A%84%E5%AE%9E%E7%8E%B0%E5%8E%9F%E7%90%86%E6%98%AF%E4%BB%80%E4%B9%88%E6%98%AF%E5%A6%82%E4%BD%95%E5%AE%9E%E7%8E%B0-sql-%E8%A7%A3%E6%9E%90%E7%9A%84%E5%91%A2
19.Flink是如何做到高效的数据交换的?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#19flink%E6%98%AF%E5%A6%82%E4%BD%95%E5%81%9A%E5%88%B0%E9%AB%98%E6%95%88%E7%9A%84%E6%95%B0%E6%8D%AE%E4%BA%A4%E6%8D%A2%E7%9A%84
20.Flink是如何做容错的?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#20flink%E6%98%AF%E5%A6%82%E4%BD%95%E5%81%9A%E5%AE%B9%E9%94%99%E7%9A%84
21.Flink 分布式快照的原理是什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#21flink-%E5%88%86%E5%B8%83%E5%BC%8F%E5%BF%AB%E7%85%A7%E7%9A%84%E5%8E%9F%E7%90%86%E6%98%AF%E4%BB%80%E4%B9%88
22.Flink 是如何保证Exactly-once语义的?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#22flink-%E6%98%AF%E5%A6%82%E4%BD%95%E4%BF%9D%E8%AF%81exactly-once%E8%AF%AD%E4%B9%89%E7%9A%84
23.Flink 的 kafka 连接器有什么特别的地方?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#23flink-%E7%9A%84-kafka-%E8%BF%9E%E6%8E%A5%E5%99%A8%E6%9C%89%E4%BB%80%E4%B9%88%E7%89%B9%E5%88%AB%E7%9A%84%E5%9C%B0%E6%96%B9
24.说说 Flink的内存管理是如何做的?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#24%E8%AF%B4%E8%AF%B4-flink%E7%9A%84%E5%86%85%E5%AD%98%E7%AE%A1%E7%90%86%E6%98%AF%E5%A6%82%E4%BD%95%E5%81%9A%E7%9A%84
25.说说 Flink的序列化如何做的?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#25%E8%AF%B4%E8%AF%B4-flink%E7%9A%84%E5%BA%8F%E5%88%97%E5%8C%96%E5%A6%82%E4%BD%95%E5%81%9A%E7%9A%84
26.Flink中的Window出现了数据倾斜,你有什么解决办法?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#26flink%E4%B8%AD%E7%9A%84window%E5%87%BA%E7%8E%B0%E4%BA%86%E6%95%B0%E6%8D%AE%E5%80%BE%E6%96%9C%E4%BD%A0%E6%9C%89%E4%BB%80%E4%B9%88%E8%A7%A3%E5%86%B3%E5%8A%9E%E6%B3%95
27.Flink中在使用聚合函数 GroupBy、Distinct、KeyBy 等函数时出现数据热点该如何解决?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#27flink%E4%B8%AD%E5%9C%A8%E4%BD%BF%E7%94%A8%E8%81%9A%E5%90%88%E5%87%BD%E6%95%B0-groupbydistinctkeyby-%E7%AD%89%E5%87%BD%E6%95%B0%E6%97%B6%E5%87%BA%E7%8E%B0%E6%95%B0%E6%8D%AE%E7%83%AD%E7%82%B9%E8%AF%A5%E5%A6%82%E4%BD%95%E8%A7%A3%E5%86%B3
28.Flink任务延迟高,想解决这个问题,你会如何入手?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#28flink%E4%BB%BB%E5%8A%A1%E5%BB%B6%E8%BF%9F%E9%AB%98%E6%83%B3%E8%A7%A3%E5%86%B3%E8%BF%99%E4%B8%AA%E9%97%AE%E9%A2%98%E4%BD%A0%E4%BC%9A%E5%A6%82%E4%BD%95%E5%85%A5%E6%89%8B
29.Flink是如何处理反压的?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#29flink%E6%98%AF%E5%A6%82%E4%BD%95%E5%A4%84%E7%90%86%E5%8F%8D%E5%8E%8B%E7%9A%84
30.Flink的反压和Strom有哪些不同?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#30flink%E7%9A%84%E5%8F%8D%E5%8E%8B%E5%92%8Cstrom%E6%9C%89%E5%93%AA%E4%BA%9B%E4%B8%8D%E5%90%8C
31.Operator Chains(算子链)这个概念你了解吗?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#31operator-chains%E7%AE%97%E5%AD%90%E9%93%BE%E8%BF%99%E4%B8%AA%E6%A6%82%E5%BF%B5%E4%BD%A0%E4%BA%86%E8%A7%A3%E5%90%97
32.Flink什么情况下才会把Operator chain在一起形成算子链?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#32flink%E4%BB%80%E4%B9%88%E6%83%85%E5%86%B5%E4%B8%8B%E6%89%8D%E4%BC%9A%E6%8A%8Aoperator-chain%E5%9C%A8%E4%B8%80%E8%B5%B7%E5%BD%A2%E6%88%90%E7%AE%97%E5%AD%90%E9%93%BE
33.消费kafka数据的时候,如何处理脏数据?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#33%E6%B6%88%E8%B4%B9kafka%E6%95%B0%E6%8D%AE%E7%9A%84%E6%97%B6%E5%80%99%E5%A6%82%E4%BD%95%E5%A4%84%E7%90%86%E8%84%8F%E6%95%B0%E6%8D%AE
参考资料https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flink.md#%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#flume
1、什么是 Flume?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#1%E4%BB%80%E4%B9%88%E6%98%AF-flume
2、Flume 特点?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#2flume-%E7%89%B9%E7%82%B9
3、flume 组成,Put 事物,Task 事务?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#3flume-%E7%BB%84%E6%88%90put-%E4%BA%8B%E7%89%A9task-%E4%BA%8B%E5%8A%A1
4、Flume 拦截器?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#4flume-%E6%8B%A6%E6%88%AA%E5%99%A8
5.flume 和 kafka 采集日志区别,采集日志时中间停了,怎么记录之前的日志?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#5flume-%E5%92%8C-kafka-%E9%87%87%E9%9B%86%E6%97%A5%E5%BF%97%E5%8C%BA%E5%88%AB%E9%87%87%E9%9B%86%E6%97%A5%E5%BF%97%E6%97%B6%E4%B8%AD%E9%97%B4%E5%81%9C%E4%BA%86%E6%80%8E%E4%B9%88%E8%AE%B0%E5%BD%95%E4%B9%8B%E5%89%8D%E7%9A%84%E6%97%A5%E5%BF%97
6、Flume 采集数据会丢失吗?(防止丢失机制)https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#6flume-%E9%87%87%E9%9B%86%E6%95%B0%E6%8D%AE%E4%BC%9A%E4%B8%A2%E5%A4%B1%E5%90%97%E9%98%B2%E6%AD%A2%E4%B8%A2%E5%A4%B1%E6%9C%BA%E5%88%B6
7、Flume 内存?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#7flume-%E5%86%85%E5%AD%98
8、FlumeChannel 优化?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#8flumechannel-%E4%BC%98%E5%8C%96
9.Flume数据传输的监控的https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#9flume%E6%95%B0%E6%8D%AE%E4%BC%A0%E8%BE%93%E7%9A%84%E7%9B%91%E6%8E%A7%E7%9A%84
10.描述Flume拦截器开发过程中的核心方法有哪几个以及各自作用是什么?拦截器带来的优缺点各是什么https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#10%E6%8F%8F%E8%BF%B0flume%E6%8B%A6%E6%88%AA%E5%99%A8%E5%BC%80%E5%8F%91%E8%BF%87%E7%A8%8B%E4%B8%AD%E7%9A%84%E6%A0%B8%E5%BF%83%E6%96%B9%E6%B3%95%E6%9C%89%E5%93%AA%E5%87%A0%E4%B8%AA%E4%BB%A5%E5%8F%8A%E5%90%84%E8%87%AA%E4%BD%9C%E7%94%A8%E6%98%AF%E4%BB%80%E4%B9%88%E6%8B%A6%E6%88%AA%E5%99%A8%E5%B8%A6%E6%9D%A5%E7%9A%84%E4%BC%98%E7%BC%BA%E7%82%B9%E5%90%84%E6%98%AF%E4%BB%80%E4%B9%88
11、flume 管道内存,flume 宕机了数据丢失怎么解决?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#11flume-%E7%AE%A1%E9%81%93%E5%86%85%E5%AD%98flume-%E5%AE%95%E6%9C%BA%E4%BA%86%E6%95%B0%E6%8D%AE%E4%B8%A2%E5%A4%B1%E6%80%8E%E4%B9%88%E8%A7%A3%E5%86%B3
12、flume 和 kafka 采集日志区别,采集日志时中间停了,怎么记录之前的日志?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#12flume-%E5%92%8C-kafka-%E9%87%87%E9%9B%86%E6%97%A5%E5%BF%97%E5%8C%BA%E5%88%AB%E9%87%87%E9%9B%86%E6%97%A5%E5%BF%97%E6%97%B6%E4%B8%AD%E9%97%B4%E5%81%9C%E4%BA%86%E6%80%8E%E4%B9%88%E8%AE%B0%E5%BD%95%E4%B9%8B%E5%89%8D%E7%9A%84%E6%97%A5%E5%BF%97
13、flume 有哪些组件,flume 的 source、channel、sink 具体是做什么的?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#13flume-%E6%9C%89%E5%93%AA%E4%BA%9B%E7%BB%84%E4%BB%B6flume-%E7%9A%84-sourcechannelsink-%E5%85%B7%E4%BD%93%E6%98%AF%E5%81%9A%E4%BB%80%E4%B9%88%E7%9A%84
14.Channel Selector中的replicating和multiplexxing各是什么含义https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#14channel-selector%E4%B8%AD%E7%9A%84replicating%E5%92%8Cmultiplexxing%E5%90%84%E6%98%AF%E4%BB%80%E4%B9%88%E5%90%AB%E4%B9%89
15.自定义开发实现TailDirSource支持递归文件夹数据的实时收集https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#15%E8%87%AA%E5%AE%9A%E4%B9%89%E5%BC%80%E5%8F%91%E5%AE%9E%E7%8E%B0taildirsource%E6%94%AF%E6%8C%81%E9%80%92%E5%BD%92%E6%96%87%E4%BB%B6%E5%A4%B9%E6%95%B0%E6%8D%AE%E7%9A%84%E5%AE%9E%E6%97%B6%E6%94%B6%E9%9B%86
16. Flume 的 Channelhttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#16-flume-%E7%9A%84-channel
17.了解 Flume 的负载均衡和故障转移吗https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#17%E4%BA%86%E8%A7%A3-flume-%E7%9A%84%E8%B4%9F%E8%BD%BD%E5%9D%87%E8%A1%A1%E5%92%8C%E6%95%85%E9%9A%9C%E8%BD%AC%E7%A7%BB%E5%90%97
18.Flume参数调优https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#18flume%E5%8F%82%E6%95%B0%E8%B0%83%E4%BC%98
19.Flume的事务机制https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#19flume%E7%9A%84%E4%BA%8B%E5%8A%A1%E6%9C%BA%E5%88%B6
20.Flume Event 是数据流的基本单元https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#20flume-event-%E6%98%AF%E6%95%B0%E6%8D%AE%E6%B5%81%E7%9A%84%E5%9F%BA%E6%9C%AC%E5%8D%95%E5%85%83
参考链接https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Flume.md#%E5%8F%82%E8%80%83%E9%93%BE%E6%8E%A5
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#hbase
1.Hbase是什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#1hbase%E6%98%AF%E4%BB%80%E4%B9%88
2.HBase 的特点是什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#2hbase-%E7%9A%84%E7%89%B9%E7%82%B9%E6%98%AF%E4%BB%80%E4%B9%88
3.HBase 和 Hive 的区别?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#3hbase-%E5%92%8C-hive-%E7%9A%84%E5%8C%BA%E5%88%AB
4.HBase 适用于怎样的情景?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#4hbase-%E9%80%82%E7%94%A8%E4%BA%8E%E6%80%8E%E6%A0%B7%E7%9A%84%E6%83%85%E6%99%AF
5.描述 HBase 的 rowKey 的设计原则?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#5%E6%8F%8F%E8%BF%B0-hbase-%E7%9A%84-rowkey-%E7%9A%84%E8%AE%BE%E8%AE%A1%E5%8E%9F%E5%88%99
6.描述 HBase 中 scan 和 get 的功能以及实现的异同?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#6%E6%8F%8F%E8%BF%B0-hbase-%E4%B8%AD-scan-%E5%92%8C-get-%E7%9A%84%E5%8A%9F%E8%83%BD%E4%BB%A5%E5%8F%8A%E5%AE%9E%E7%8E%B0%E7%9A%84%E5%BC%82%E5%90%8C
7.请详细描述 HBase 中一个 cell 的结构?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#7%E8%AF%B7%E8%AF%A6%E7%BB%86%E6%8F%8F%E8%BF%B0-hbase-%E4%B8%AD%E4%B8%80%E4%B8%AA-cell-%E7%9A%84%E7%BB%93%E6%9E%84
8.简述 HBase 中 compact 用途是什么,什么时候触发,分为哪两种,有什么区别,有哪些相关配置参数。https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#8%E7%AE%80%E8%BF%B0-hbase-%E4%B8%AD-compact-%E7%94%A8%E9%80%94%E6%98%AF%E4%BB%80%E4%B9%88%E4%BB%80%E4%B9%88%E6%97%B6%E5%80%99%E8%A7%A6%E5%8F%91%E5%88%86%E4%B8%BA%E5%93%AA%E4%B8%A4%E7%A7%8D%E6%9C%89%E4%BB%80%E4%B9%88%E5%8C%BA%E5%88%AB%E6%9C%89%E5%93%AA%E4%BA%9B%E7%9B%B8%E5%85%B3%E9%85%8D%E7%BD%AE%E5%8F%82%E6%95%B0
9.HBase 如何优化?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#9hbase-%E5%A6%82%E4%BD%95%E4%BC%98%E5%8C%96
10.Region 如何预建分区?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#10region-%E5%A6%82%E4%BD%95%E9%A2%84%E5%BB%BA%E5%88%86%E5%8C%BA
11.HRegionServer 宕机如何处理?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#11hregionserver-%E5%AE%95%E6%9C%BA%E5%A6%82%E4%BD%95%E5%A4%84%E7%90%86
12.HBase 读写流程?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#12hbase-%E8%AF%BB%E5%86%99%E6%B5%81%E7%A8%8B
13.HBase 内部机制是什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#13hbase-%E5%86%85%E9%83%A8%E6%9C%BA%E5%88%B6%E6%98%AF%E4%BB%80%E4%B9%88
14.HBase 在进行模型设计时重点在什么地方?一张表中定义多少个 Column Family 最合适?为什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#14hbase-%E5%9C%A8%E8%BF%9B%E8%A1%8C%E6%A8%A1%E5%9E%8B%E8%AE%BE%E8%AE%A1%E6%97%B6%E9%87%8D%E7%82%B9%E5%9C%A8%E4%BB%80%E4%B9%88%E5%9C%B0%E6%96%B9%E4%B8%80%E5%BC%A0%E8%A1%A8%E4%B8%AD%E5%AE%9A%E4%B9%89%E5%A4%9A%E5%B0%91%E4%B8%AA-column-family-%E6%9C%80%E5%90%88%E9%80%82%E4%B8%BA%E4%BB%80%E4%B9%88
15.如何提高 HBase 客户端的读写性能?请举例说明。https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#15%E5%A6%82%E4%BD%95%E6%8F%90%E9%AB%98-hbase-%E5%AE%A2%E6%88%B7%E7%AB%AF%E7%9A%84%E8%AF%BB%E5%86%99%E6%80%A7%E8%83%BD%E8%AF%B7%E4%B8%BE%E4%BE%8B%E8%AF%B4%E6%98%8E
16.直接将时间戳作为行健,在写入单个 region 时候会发生热点问题,为什么呢?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#16%E7%9B%B4%E6%8E%A5%E5%B0%86%E6%97%B6%E9%97%B4%E6%88%B3%E4%BD%9C%E4%B8%BA%E8%A1%8C%E5%81%A5%E5%9C%A8%E5%86%99%E5%85%A5%E5%8D%95%E4%B8%AA-region-%E6%97%B6%E5%80%99%E4%BC%9A%E5%8F%91%E7%94%9F%E7%83%AD%E7%82%B9%E9%97%AE%E9%A2%98%E4%B8%BA%E4%BB%80%E4%B9%88%E5%91%A2
17.请描述如何解决 HBase 中 region 太小和 region 太大带来的冲突?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#17%E8%AF%B7%E6%8F%8F%E8%BF%B0%E5%A6%82%E4%BD%95%E8%A7%A3%E5%86%B3-hbase-%E4%B8%AD-region-%E5%A4%AA%E5%B0%8F%E5%92%8C-region-%E5%A4%AA%E5%A4%A7%E5%B8%A6%E6%9D%A5%E7%9A%84%E5%86%B2%E7%AA%81
参考链接https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/HBase.md#%E5%8F%82%E8%80%83%E9%93%BE%E6%8E%A5
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#hadoop
1、简要描述如何安装配置一个 apache 开源版 hadoop,描述即可,列出步骤更好https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#1%E7%AE%80%E8%A6%81%E6%8F%8F%E8%BF%B0%E5%A6%82%E4%BD%95%E5%AE%89%E8%A3%85%E9%85%8D%E7%BD%AE%E4%B8%80%E4%B8%AA-apache-%E5%BC%80%E6%BA%90%E7%89%88-hadoop%E6%8F%8F%E8%BF%B0%E5%8D%B3%E5%8F%AF%E5%88%97%E5%87%BA%E6%AD%A5%E9%AA%A4%E6%9B%B4%E5%A5%BD
2、请列出正常工作的 hadoop 集群中 hadoop 都需要启动哪些进程,他们的作用分别是什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#2%E8%AF%B7%E5%88%97%E5%87%BA%E6%AD%A3%E5%B8%B8%E5%B7%A5%E4%BD%9C%E7%9A%84-hadoop-%E9%9B%86%E7%BE%A4%E4%B8%AD-hadoop-%E9%83%BD%E9%9C%80%E8%A6%81%E5%90%AF%E5%8A%A8%E5%93%AA%E4%BA%9B%E8%BF%9B%E7%A8%8B%E4%BB%96%E4%BB%AC%E7%9A%84%E4%BD%9C%E7%94%A8%E5%88%86%E5%88%AB%E6%98%AF%E4%BB%80%E4%B9%88
3、启动 hadoop 报如下错误,该如何解决?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#3%E5%90%AF%E5%8A%A8-hadoop-%E6%8A%A5%E5%A6%82%E4%B8%8B%E9%94%99%E8%AF%AF%E8%AF%A5%E5%A6%82%E4%BD%95%E8%A7%A3%E5%86%B3
4、请列出你所知道的 hadoop 调度器,并简要说明其工作方法?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#4%E8%AF%B7%E5%88%97%E5%87%BA%E4%BD%A0%E6%89%80%E7%9F%A5%E9%81%93%E7%9A%84-hadoop-%E8%B0%83%E5%BA%A6%E5%99%A8%E5%B9%B6%E7%AE%80%E8%A6%81%E8%AF%B4%E6%98%8E%E5%85%B6%E5%B7%A5%E4%BD%9C%E6%96%B9%E6%B3%95
5、当前日志采样格式为如下,请编写 MapReduce 计算第四列每个元素出现的个数https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#5%E5%BD%93%E5%89%8D%E6%97%A5%E5%BF%97%E9%87%87%E6%A0%B7%E6%A0%BC%E5%BC%8F%E4%B8%BA%E5%A6%82%E4%B8%8B%E8%AF%B7%E7%BC%96%E5%86%99-mapreduce-%E8%AE%A1%E7%AE%97%E7%AC%AC%E5%9B%9B%E5%88%97%E6%AF%8F%E4%B8%AA%E5%85%83%E7%B4%A0%E5%87%BA%E7%8E%B0%E7%9A%84%E4%B8%AA%E6%95%B0
6、hive 有哪些方式保存元数据,各有哪些特点?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#6hive-%E6%9C%89%E5%93%AA%E4%BA%9B%E6%96%B9%E5%BC%8F%E4%BF%9D%E5%AD%98%E5%85%83%E6%95%B0%E6%8D%AE%E5%90%84%E6%9C%89%E5%93%AA%E4%BA%9B%E7%89%B9%E7%82%B9
7、请简述 hadoop 怎么样实现二级排序?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#7%E8%AF%B7%E7%AE%80%E8%BF%B0-hadoop-%E6%80%8E%E4%B9%88%E6%A0%B7%E5%AE%9E%E7%8E%B0%E4%BA%8C%E7%BA%A7%E6%8E%92%E5%BA%8F
8、用非递归方法实现二分查找https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#8%E7%94%A8%E9%9D%9E%E9%80%92%E5%BD%92%E6%96%B9%E6%B3%95%E5%AE%9E%E7%8E%B0%E4%BA%8C%E5%88%86%E6%9F%A5%E6%89%BE
9、请简述 mapreduce 中,combiner,partition 作用?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#9%E8%AF%B7%E7%AE%80%E8%BF%B0-mapreduce-%E4%B8%ADcombinerpartition-%E4%BD%9C%E7%94%A8
10、HDFS 数据写入实现机制https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#10hdfs-%E6%95%B0%E6%8D%AE%E5%86%99%E5%85%A5%E5%AE%9E%E7%8E%B0%E6%9C%BA%E5%88%B6
11、hadoop 节点的动态上线下线的大概操作https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#11hadoop-%E8%8A%82%E7%82%B9%E7%9A%84%E5%8A%A8%E6%80%81%E4%B8%8A%E7%BA%BF%E4%B8%8B%E7%BA%BF%E7%9A%84%E5%A4%A7%E6%A6%82%E6%93%8D%E4%BD%9C
12.MapTask 并行机制是由什么决定的?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#12maptask-%E5%B9%B6%E8%A1%8C%E6%9C%BA%E5%88%B6%E6%98%AF%E7%94%B1%E4%BB%80%E4%B9%88%E5%86%B3%E5%AE%9A%E7%9A%84
13.MR 是干什么的?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#13mr-%E6%98%AF%E5%B9%B2%E4%BB%80%E4%B9%88%E7%9A%84
14.combiner 和 partition 的作用:https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#14combiner-%E5%92%8C-partition-%E7%9A%84%E4%BD%9C%E7%94%A8
15.什么是 shufflehttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#15%E4%BB%80%E4%B9%88%E6%98%AF-shuffle
16.列举几个 hadoop 生态圈的组件并做简要描述https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#16%E5%88%97%E4%B8%BE%E5%87%A0%E4%B8%AA-hadoop-%E7%94%9F%E6%80%81%E5%9C%88%E7%9A%84%E7%BB%84%E4%BB%B6%E5%B9%B6%E5%81%9A%E7%AE%80%E8%A6%81%E6%8F%8F%E8%BF%B0
17.NameNode 的 Safemode 是怎么回事? 如何才能退出 safemode?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#17namenode-%E7%9A%84-safemode-%E6%98%AF%E6%80%8E%E4%B9%88%E5%9B%9E%E4%BA%8B-%E5%A6%82%E4%BD%95%E6%89%8D%E8%83%BD%E9%80%80%E5%87%BA-safemode
18.SecondaryNameNode 的主要职责是什么?简述其工作机制https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#18secondarynamenode-%E7%9A%84%E4%B8%BB%E8%A6%81%E8%81%8C%E8%B4%A3%E6%98%AF%E4%BB%80%E4%B9%88%E7%AE%80%E8%BF%B0%E5%85%B6%E5%B7%A5%E4%BD%9C%E6%9C%BA%E5%88%B6
19.一个 datanode 宕机,怎么恢复,简单说一下恢复流程?(运维)https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#19%E4%B8%80%E4%B8%AA-datanode-%E5%AE%95%E6%9C%BA%E6%80%8E%E4%B9%88%E6%81%A2%E5%A4%8D%E7%AE%80%E5%8D%95%E8%AF%B4%E4%B8%80%E4%B8%8B%E6%81%A2%E5%A4%8D%E6%B5%81%E7%A8%8B%E8%BF%90%E7%BB%B4
20.hadoop 的 namenode 宕机,怎么解决?(运维)https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#20hadoop-%E7%9A%84-namenode-%E5%AE%95%E6%9C%BA%E6%80%8E%E4%B9%88%E8%A7%A3%E5%86%B3%E8%BF%90%E7%BB%B4
21.简述 hadoop 安装?(运维)https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#21%E7%AE%80%E8%BF%B0-hadoop-%E5%AE%89%E8%A3%85%E8%BF%90%E7%BB%B4
22.Hadoop 中需要哪些配置文件,其作用是什么?(运维)https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#22hadoop-%E4%B8%AD%E9%9C%80%E8%A6%81%E5%93%AA%E4%BA%9B%E9%85%8D%E7%BD%AE%E6%96%87%E4%BB%B6%E5%85%B6%E4%BD%9C%E7%94%A8%E6%98%AF%E4%BB%80%E4%B9%88%E8%BF%90%E7%BB%B4
23. 请列出 hadoop 正常工作时要启动哪些进程,并写出各自的作用https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#23-%E8%AF%B7%E5%88%97%E5%87%BA-hadoop-%E6%AD%A3%E5%B8%B8%E5%B7%A5%E4%BD%9C%E6%97%B6%E8%A6%81%E5%90%AF%E5%8A%A8%E5%93%AA%E4%BA%9B%E8%BF%9B%E7%A8%8B%E5%B9%B6%E5%86%99%E5%87%BA%E5%90%84%E8%87%AA%E7%9A%84%E4%BD%9C%E7%94%A8
参考链接https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hadoop.md#%E5%8F%82%E8%80%83%E9%93%BE%E6%8E%A5
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#hive
1.Hive与传统数据库的区别https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#1hive%E4%B8%8E%E4%BC%A0%E7%BB%9F%E6%95%B0%E6%8D%AE%E5%BA%93%E7%9A%84%E5%8C%BA%E5%88%AB
2.Hive内部表和外部表的区别https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#2hive%E5%86%85%E9%83%A8%E8%A1%A8%E5%92%8C%E5%A4%96%E9%83%A8%E8%A1%A8%E7%9A%84%E5%8C%BA%E5%88%AB
3.Hive中order by,sort by,distribute by和cluster by的区别https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#3hive%E4%B8%ADorder-bysort-bydistribute-by%E5%92%8Ccluster-by%E7%9A%84%E5%8C%BA%E5%88%AB
4.row_number(),rank()和dense_rank()的区别https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#4row_numberrank%E5%92%8Cdense_rank%E7%9A%84%E5%8C%BA%E5%88%AB
5.Hive中常用的系统函数有哪些https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#5hive%E4%B8%AD%E5%B8%B8%E7%94%A8%E7%9A%84%E7%B3%BB%E7%BB%9F%E5%87%BD%E6%95%B0%E6%9C%89%E5%93%AA%E4%BA%9B
6.Hive如何实现分区https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#6hive%E5%A6%82%E4%BD%95%E5%AE%9E%E7%8E%B0%E5%88%86%E5%8C%BA
7.Hive导入数据的五种方式https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#7hive%E5%AF%BC%E5%85%A5%E6%95%B0%E6%8D%AE%E7%9A%84%E4%BA%94%E7%A7%8D%E6%96%B9%E5%BC%8F
8.Hive导出数据的五种方式https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#8hive%E5%AF%BC%E5%87%BA%E6%95%B0%E6%8D%AE%E7%9A%84%E4%BA%94%E7%A7%8D%E6%96%B9%E5%BC%8F
9.Hive 表关联查询,如何解决数据倾斜的问题?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#9hive-%E8%A1%A8%E5%85%B3%E8%81%94%E6%9F%A5%E8%AF%A2%E5%A6%82%E4%BD%95%E8%A7%A3%E5%86%B3%E6%95%B0%E6%8D%AE%E5%80%BE%E6%96%9C%E7%9A%84%E9%97%AE%E9%A2%98
10.写出hive 中split、coalesce 及collect_list 函数的用法(可举例)?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#10%E5%86%99%E5%87%BAhive-%E4%B8%ADsplitcoalesce-%E5%8F%8Acollect_list-%E5%87%BD%E6%95%B0%E7%9A%84%E7%94%A8%E6%B3%95%E5%8F%AF%E4%B8%BE%E4%BE%8B
11.Hive 有哪些方式保存元数据,各有哪些特点?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#11hive-%E6%9C%89%E5%93%AA%E4%BA%9B%E6%96%B9%E5%BC%8F%E4%BF%9D%E5%AD%98%E5%85%83%E6%95%B0%E6%8D%AE%E5%90%84%E6%9C%89%E5%93%AA%E4%BA%9B%E7%89%B9%E7%82%B9
12.Hive 的HSQL 转换为MapReduce 的过程?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#12hive-%E7%9A%84hsql-%E8%BD%AC%E6%8D%A2%E4%B8%BAmapreduce-%E7%9A%84%E8%BF%87%E7%A8%8B
13.Hive join 过程中大表小表的放置顺序?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#13hive-join-%E8%BF%87%E7%A8%8B%E4%B8%AD%E5%A4%A7%E8%A1%A8%E5%B0%8F%E8%A1%A8%E7%9A%84%E6%94%BE%E7%BD%AE%E9%A1%BA%E5%BA%8F
14.Hive 的两张表关联,使用MapReduce 怎么实现?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#14hive-%E7%9A%84%E4%B8%A4%E5%BC%A0%E8%A1%A8%E5%85%B3%E8%81%94%E4%BD%BF%E7%94%A8mapreduce-%E6%80%8E%E4%B9%88%E5%AE%9E%E7%8E%B0
15.所有的Hive 任务都会有MapReduce 的执行吗?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#15%E6%89%80%E6%9C%89%E7%9A%84hive-%E4%BB%BB%E5%8A%A1%E9%83%BD%E4%BC%9A%E6%9C%89mapreduce-%E7%9A%84%E6%89%A7%E8%A1%8C%E5%90%97
16.Hive 的函数:UDF、UDAF、UDTF 的区别?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#16hive-%E7%9A%84%E5%87%BD%E6%95%B0udfudafudtf-%E7%9A%84%E5%8C%BA%E5%88%AB
17.说说对Hive 桶表的理解?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#17%E8%AF%B4%E8%AF%B4%E5%AF%B9hive-%E6%A1%B6%E8%A1%A8%E7%9A%84%E7%90%86%E8%A7%A3
18.Hive 自定义UDF 函数的流程?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#18hive-%E8%87%AA%E5%AE%9A%E4%B9%89udf-%E5%87%BD%E6%95%B0%E7%9A%84%E6%B5%81%E7%A8%8B
19.说下Hive的基本架构https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#19%E8%AF%B4%E4%B8%8Bhive%E7%9A%84%E5%9F%BA%E6%9C%AC%E6%9E%B6%E6%9E%84
20.hive分区和分桶的区别https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#20hive%E5%88%86%E5%8C%BA%E5%92%8C%E5%88%86%E6%A1%B6%E7%9A%84%E5%8C%BA%E5%88%AB
21.hive的执行流程https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#21hive%E7%9A%84%E6%89%A7%E8%A1%8C%E6%B5%81%E7%A8%8B
参考资料https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Hive.md#%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#impala
1.Impala是什么https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Impala.md#1impala%E6%98%AF%E4%BB%80%E4%B9%88
2.Impala特点https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Impala.md#2impala%E7%89%B9%E7%82%B9
3.Impala的缺点https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Impala.md#3impala%E7%9A%84%E7%BC%BA%E7%82%B9
4.说下Impala的核心组件https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Impala.md#4%E8%AF%B4%E4%B8%8Bimpala%E7%9A%84%E6%A0%B8%E5%BF%83%E7%BB%84%E4%BB%B6
5.Impala的整体架构流程了解吗?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Impala.md#5impala%E7%9A%84%E6%95%B4%E4%BD%93%E6%9E%B6%E6%9E%84%E6%B5%81%E7%A8%8B%E4%BA%86%E8%A7%A3%E5%90%97
6.Impala与hive的异同了解吗https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Impala.md#6impala%E4%B8%8Ehive%E7%9A%84%E5%BC%82%E5%90%8C%E4%BA%86%E8%A7%A3%E5%90%97
参考资料https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Impala.md#%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#oozie
1.oozie 是什么https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#1oozie-%E6%98%AF%E4%BB%80%E4%B9%88
2.三个主要概念https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#2%E4%B8%89%E4%B8%AA%E4%B8%BB%E8%A6%81%E6%A6%82%E5%BF%B5
3.Workflowhttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#3workflow
4.Coordinatorhttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#4coordinator
5.Bundlehttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#5bundle
6.oozie各个组件之间的关系https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#6oozie%E5%90%84%E4%B8%AA%E7%BB%84%E4%BB%B6%E4%B9%8B%E9%97%B4%E7%9A%84%E5%85%B3%E7%B3%BB
7.节点类型https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#7%E8%8A%82%E7%82%B9%E7%B1%BB%E5%9E%8B
8. 流程控制节点https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#8--%E6%B5%81%E7%A8%8B%E6%8E%A7%E5%88%B6%E8%8A%82%E7%82%B9
9.动作节点https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#9%E5%8A%A8%E4%BD%9C%E8%8A%82%E7%82%B9
10.Oozie Cli命令 启动任务https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#10oozie-cli%E5%91%BD%E4%BB%A4-%E5%90%AF%E5%8A%A8%E4%BB%BB%E5%8A%A1
12.Oozie Cli命令 停止任务https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#12oozie-cli%E5%91%BD%E4%BB%A4-%E5%81%9C%E6%AD%A2%E4%BB%BB%E5%8A%A1
13.Oozie Cli命令 提交任务https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#13oozie-cli%E5%91%BD%E4%BB%A4-%E6%8F%90%E4%BA%A4%E4%BB%BB%E5%8A%A1
14.Oozie Cli命令 开始任务https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#14oozie-cli%E5%91%BD%E4%BB%A4-%E5%BC%80%E5%A7%8B%E4%BB%BB%E5%8A%A1
15.Oozie Cli命令 查看任务执行情况https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#15oozie-cli%E5%91%BD%E4%BB%A4-%E6%9F%A5%E7%9C%8B%E4%BB%BB%E5%8A%A1%E6%89%A7%E8%A1%8C%E6%83%85%E5%86%B5
参考链接https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Oozie.md#%E5%8F%82%E8%80%83%E9%93%BE%E6%8E%A5
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#presto
1.什么是prestohttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#1%E4%BB%80%E4%B9%88%E6%98%AFpresto
2.presto优势https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#2presto%E4%BC%98%E5%8A%BF
3.presto查询速度规模https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#3presto%E6%9F%A5%E8%AF%A2%E9%80%9F%E5%BA%A6%E8%A7%84%E6%A8%A1
4.presto数据模型https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#4presto%E6%95%B0%E6%8D%AE%E6%A8%A1%E5%9E%8B
5.presto架构https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#5presto%E6%9E%B6%E6%9E%84
6.presto 接入方式https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#6presto-%E6%8E%A5%E5%85%A5%E6%96%B9%E5%BC%8F
7.preto缺点https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#7preto%E7%BC%BA%E7%82%B9
8.Coordinatorhttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#8coordinator
9.Workerhttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#9worker
10.Connectorhttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#10connector
11.Cataloghttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#11catalog
12.Schemahttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#12schema
13.Tablehttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#13table
14.Statementhttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#14statement
15.Queryhttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#15query
16.Stagehttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#16stage
17.Taskhttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#17task
18.Splithttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#18split
19.Driverhttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#19driver
20. Operatorhttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#20-operator
21. Exchangehttps://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#21-exchange
参考链接https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Presto.md#%E5%8F%82%E8%80%83%E9%93%BE%E6%8E%A5
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#spark
1.spark有几种部署模式,每种模式的特点?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#1spark%E6%9C%89%E5%87%A0%E7%A7%8D%E9%83%A8%E7%BD%B2%E6%A8%A1%E5%BC%8F%E6%AF%8F%E7%A7%8D%E6%A8%A1%E5%BC%8F%E7%9A%84%E7%89%B9%E7%82%B9
2.Spark技术栈有哪些组件,每个组件都有什么功能,适合什么应用场景?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#2spark%E6%8A%80%E6%9C%AF%E6%A0%88%E6%9C%89%E5%93%AA%E4%BA%9B%E7%BB%84%E4%BB%B6%E6%AF%8F%E4%B8%AA%E7%BB%84%E4%BB%B6%E9%83%BD%E6%9C%89%E4%BB%80%E4%B9%88%E5%8A%9F%E8%83%BD%E9%80%82%E5%90%88%E4%BB%80%E4%B9%88%E5%BA%94%E7%94%A8%E5%9C%BA%E6%99%AF
3.spark有哪些组件https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#3spark%E6%9C%89%E5%93%AA%E4%BA%9B%E7%BB%84%E4%BB%B6
4.spark工作机制https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#4spark%E5%B7%A5%E4%BD%9C%E6%9C%BA%E5%88%B6
5.Spark应用程序的执行过程https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#5spark%E5%BA%94%E7%94%A8%E7%A8%8B%E5%BA%8F%E7%9A%84%E6%89%A7%E8%A1%8C%E8%BF%87%E7%A8%8B
6.driver的功能是什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#6driver%E7%9A%84%E5%8A%9F%E8%83%BD%E6%98%AF%E4%BB%80%E4%B9%88
7.Spark中Worker的主要工作是什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#7spark%E4%B8%ADworker%E7%9A%84%E4%B8%BB%E8%A6%81%E5%B7%A5%E4%BD%9C%E6%98%AF%E4%BB%80%E4%B9%88
8.task有几种类型?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#8task%E6%9C%89%E5%87%A0%E7%A7%8D%E7%B1%BB%E5%9E%8B
9.什么是shuffle,以及为什么需要shuffle?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#9%E4%BB%80%E4%B9%88%E6%98%AFshuffle%E4%BB%A5%E5%8F%8A%E4%B8%BA%E4%BB%80%E4%B9%88%E9%9C%80%E8%A6%81shuffle
10.Spark master HA 主从切换过程不会影响集群已有的作业运行,为什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#10spark-master-ha-%E4%B8%BB%E4%BB%8E%E5%88%87%E6%8D%A2%E8%BF%87%E7%A8%8B%E4%B8%8D%E4%BC%9A%E5%BD%B1%E5%93%8D%E9%9B%86%E7%BE%A4%E5%B7%B2%E6%9C%89%E7%9A%84%E4%BD%9C%E4%B8%9A%E8%BF%90%E8%A1%8C%E4%B8%BA%E4%BB%80%E4%B9%88
11.Spark并行度怎么设置比较合适https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#11spark%E5%B9%B6%E8%A1%8C%E5%BA%A6%E6%80%8E%E4%B9%88%E8%AE%BE%E7%BD%AE%E6%AF%94%E8%BE%83%E5%90%88%E9%80%82
12.Spark程序执行,有时候默认为什么会产生很多task,怎么修改默认task执行个数?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#12spark%E7%A8%8B%E5%BA%8F%E6%89%A7%E8%A1%8C%E6%9C%89%E6%97%B6%E5%80%99%E9%BB%98%E8%AE%A4%E4%B8%BA%E4%BB%80%E4%B9%88%E4%BC%9A%E4%BA%A7%E7%94%9F%E5%BE%88%E5%A4%9Atask%E6%80%8E%E4%B9%88%E4%BF%AE%E6%94%B9%E9%BB%98%E8%AE%A4task%E6%89%A7%E8%A1%8C%E4%B8%AA%E6%95%B0
13.Spark中数据的位置是被谁管理的?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#13spark%E4%B8%AD%E6%95%B0%E6%8D%AE%E7%9A%84%E4%BD%8D%E7%BD%AE%E6%98%AF%E8%A2%AB%E8%B0%81%E7%AE%A1%E7%90%86%E7%9A%84
14.为什么要进行序列化https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#14%E4%B8%BA%E4%BB%80%E4%B9%88%E8%A6%81%E8%BF%9B%E8%A1%8C%E5%BA%8F%E5%88%97%E5%8C%96
15.Spark如何处理不能被序列化的对象?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#15spark%E5%A6%82%E4%BD%95%E5%A4%84%E7%90%86%E4%B8%8D%E8%83%BD%E8%A2%AB%E5%BA%8F%E5%88%97%E5%8C%96%E7%9A%84%E5%AF%B9%E8%B1%A1
16.Spark提交你的jar包时所用的命令是什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#16spark%E6%8F%90%E4%BA%A4%E4%BD%A0%E7%9A%84jar%E5%8C%85%E6%97%B6%E6%89%80%E7%94%A8%E7%9A%84%E5%91%BD%E4%BB%A4%E6%98%AF%E4%BB%80%E4%B9%88
17.Mapreduce和Spark的相同和区别https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#17mapreduce%E5%92%8Cspark%E7%9A%84%E7%9B%B8%E5%90%8C%E5%92%8C%E5%8C%BA%E5%88%AB
18.简单说一下hadoop和spark的shuffle相同和差异?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#18%E7%AE%80%E5%8D%95%E8%AF%B4%E4%B8%80%E4%B8%8Bhadoop%E5%92%8Cspark%E7%9A%84shuffle%E7%9B%B8%E5%90%8C%E5%92%8C%E5%B7%AE%E5%BC%82
19. 简单说一下hadoop和spark的shuffle过程https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#19-%E7%AE%80%E5%8D%95%E8%AF%B4%E4%B8%80%E4%B8%8Bhadoop%E5%92%8Cspark%E7%9A%84shuffle%E8%BF%87%E7%A8%8B
20.partition和block的关联https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#20partition%E5%92%8Cblock%E7%9A%84%E5%85%B3%E8%81%94
21.Spark为什么比mapreduce快?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#21spark%E4%B8%BA%E4%BB%80%E4%B9%88%E6%AF%94mapreduce%E5%BF%AB
22.Mapreduce操作的mapper和reducer阶段相当于spark中的哪几个算子?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#22mapreduce%E6%93%8D%E4%BD%9C%E7%9A%84mapper%E5%92%8Creducer%E9%98%B6%E6%AE%B5%E7%9B%B8%E5%BD%93%E4%BA%8Espark%E4%B8%AD%E7%9A%84%E5%93%AA%E5%87%A0%E4%B8%AA%E7%AE%97%E5%AD%90
23.RDD机制https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#23rdd%E6%9C%BA%E5%88%B6
24.RDD的弹性表现在哪几点?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#24rdd%E7%9A%84%E5%BC%B9%E6%80%A7%E8%A1%A8%E7%8E%B0%E5%9C%A8%E5%93%AA%E5%87%A0%E7%82%B9
25.RDD有哪些缺陷?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#25rdd%E6%9C%89%E5%93%AA%E4%BA%9B%E7%BC%BA%E9%99%B7
26.什么是RDD宽依赖和窄依赖?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#26%E4%BB%80%E4%B9%88%E6%98%AFrdd%E5%AE%BD%E4%BE%9D%E8%B5%96%E5%92%8C%E7%AA%84%E4%BE%9D%E8%B5%96
27.rdd有几种操作类型?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#27rdd%E6%9C%89%E5%87%A0%E7%A7%8D%E6%93%8D%E4%BD%9C%E7%B1%BB%E5%9E%8B
28.Spark累加器有哪些特点?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#28spark%E7%B4%AF%E5%8A%A0%E5%99%A8%E6%9C%89%E5%93%AA%E4%BA%9B%E7%89%B9%E7%82%B9
29.spark hashParitioner的弊端https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#29spark-hashparitioner%E7%9A%84%E5%BC%8A%E7%AB%AF
30.RangePartitioner分区的原理https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#30rangepartitioner%E5%88%86%E5%8C%BA%E7%9A%84%E5%8E%9F%E7%90%86
参考资料https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Spark.md#%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#sqoop
1.Sqoop 在工作中的定位是会用就行https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Sqoop.md#1sqoop-%E5%9C%A8%E5%B7%A5%E4%BD%9C%E4%B8%AD%E7%9A%84%E5%AE%9A%E4%BD%8D%E6%98%AF%E4%BC%9A%E7%94%A8%E5%B0%B1%E8%A1%8C
2.Sqoop导入hive时的参数https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Sqoop.md#2sqoop%E5%AF%BC%E5%85%A5hive%E6%97%B6%E7%9A%84%E5%8F%82%E6%95%B0
3.Rdbms中的增量数据如何导入?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Sqoop.md#3rdbms%E4%B8%AD%E7%9A%84%E5%A2%9E%E9%87%8F%E6%95%B0%E6%8D%AE%E5%A6%82%E4%BD%95%E5%AF%BC%E5%85%A5
4.Sqoop导入导出Null存储一致性问题https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Sqoop.md#4sqoop%E5%AF%BC%E5%85%A5%E5%AF%BC%E5%87%BAnull%E5%AD%98%E5%82%A8%E4%B8%80%E8%87%B4%E6%80%A7%E9%97%AE%E9%A2%98
5.Sqoop数据导出一致性问题https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Sqoop.md#5sqoop%E6%95%B0%E6%8D%AE%E5%AF%BC%E5%87%BA%E4%B8%80%E8%87%B4%E6%80%A7%E9%97%AE%E9%A2%98
6.Sqoop底层运行的任务是什么https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Sqoop.md#6sqoop%E5%BA%95%E5%B1%82%E8%BF%90%E8%A1%8C%E7%9A%84%E4%BB%BB%E5%8A%A1%E6%98%AF%E4%BB%80%E4%B9%88
7.Map task并行度设置大于1的问题https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Sqoop.md#7map-task%E5%B9%B6%E8%A1%8C%E5%BA%A6%E8%AE%BE%E7%BD%AE%E5%A4%A7%E4%BA%8E1%E7%9A%84%E9%97%AE%E9%A2%98
8.Sqoop数据导出的时候一次执行多长时间https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Sqoop.md#8sqoop%E6%95%B0%E6%8D%AE%E5%AF%BC%E5%87%BA%E7%9A%84%E6%97%B6%E5%80%99%E4%B8%80%E6%AC%A1%E6%89%A7%E8%A1%8C%E5%A4%9A%E9%95%BF%E6%97%B6%E9%97%B4
9.sqoop 导入数据到HDFS注意事项https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Sqoop.md#9sqoop-%E5%AF%BC%E5%85%A5%E6%95%B0%E6%8D%AE%E5%88%B0hdfs%E6%B3%A8%E6%84%8F%E4%BA%8B%E9%A1%B9
10.Sqoop1和sqoop2优缺点:https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Sqoop.md#10sqoop1%E5%92%8Csqoop2%E4%BC%98%E7%BC%BA%E7%82%B9
参考链接https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Sqoop.md#%E5%8F%82%E8%80%83%E9%93%BE%E6%8E%A5
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#storm
1.什么是 storm?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#1-%E4%BB%80%E4%B9%88%E6%98%AF-storm
2.提高并发度https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#2%E6%8F%90%E9%AB%98%E5%B9%B6%E5%8F%91%E5%BA%A6
3.当 Nimbus 或 Supervisor 守护进程死亡时会发生什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#3%E5%BD%93-nimbus-%E6%88%96-supervisor-%E5%AE%88%E6%8A%A4%E8%BF%9B%E7%A8%8B%E6%AD%BB%E4%BA%A1%E6%97%B6%E4%BC%9A%E5%8F%91%E7%94%9F%E4%BB%80%E4%B9%88
4.Nimbus 是单点故障吗?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#4nimbus-%E6%98%AF%E5%8D%95%E7%82%B9%E6%95%85%E9%9A%9C%E5%90%97
5.Storm 如何保证数据处理?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#5storm-%E5%A6%82%E4%BD%95%E4%BF%9D%E8%AF%81%E6%95%B0%E6%8D%AE%E5%A4%84%E7%90%86
6.storm 的可靠性如何实现,包括 spout 和 bolt 两部分?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#6storm-%E7%9A%84%E5%8F%AF%E9%9D%A0%E6%80%A7%E5%A6%82%E4%BD%95%E5%AE%9E%E7%8E%B0%E5%8C%85%E6%8B%AC-spout-%E5%92%8C-bolt-%E4%B8%A4%E9%83%A8%E5%88%86
7.storm 分组策略方式?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#7storm-%E5%88%86%E7%BB%84%E7%AD%96%E7%95%A5%E6%96%B9%E5%BC%8F
8.Storm 的物理架构?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#8storm-%E7%9A%84%E7%89%A9%E7%90%86%E6%9E%B6%E6%9E%84
9.Storm 实时低延迟的原因https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#9storm-%E5%AE%9E%E6%97%B6%E4%BD%8E%E5%BB%B6%E8%BF%9F%E7%9A%84%E5%8E%9F%E5%9B%A0
10.离线计算是什么?流式计算是什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#10%E7%A6%BB%E7%BA%BF%E8%AE%A1%E7%AE%97%E6%98%AF%E4%BB%80%E4%B9%88%E6%B5%81%E5%BC%8F%E8%AE%A1%E7%AE%97%E6%98%AF%E4%BB%80%E4%B9%88
11.Storm 与 Hadoop 的区别https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#11storm-%E4%B8%8E-hadoop-%E7%9A%84%E5%8C%BA%E5%88%AB
12.Storm 核心组件https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#12storm-%E6%A0%B8%E5%BF%83%E7%BB%84%E4%BB%B6
13.当一个 worker 挂掉时会发生什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#13%E5%BD%93%E4%B8%80%E4%B8%AA-worker-%E6%8C%82%E6%8E%89%E6%97%B6%E4%BC%9A%E5%8F%91%E7%94%9F%E4%BB%80%E4%B9%88
14.当一个 node(节点)挂掉时会发生什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#14-%E5%BD%93%E4%B8%80%E4%B8%AA-node%E8%8A%82%E7%82%B9%E6%8C%82%E6%8E%89%E6%97%B6%E4%BC%9A%E5%8F%91%E7%94%9F%E4%BB%80%E4%B9%88
15.当 Nimbus 或 Supervisor 守护进程死亡时会发生什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#15%E5%BD%93-nimbus-%E6%88%96-supervisor-%E5%AE%88%E6%8A%A4%E8%BF%9B%E7%A8%8B%E6%AD%BB%E4%BA%A1%E6%97%B6%E4%BC%9A%E5%8F%91%E7%94%9F%E4%BB%80%E4%B9%88
16.流的模式是什么?默认是什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#16%E6%B5%81%E7%9A%84%E6%A8%A1%E5%BC%8F%E6%98%AF%E4%BB%80%E4%B9%88%E9%BB%98%E8%AE%A4%E6%98%AF%E4%BB%80%E4%B9%88
17.Storm Group 分类https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#17storm-group-%E5%88%86%E7%B1%BB
18.Storm 的特点和特性是什么?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#18storm-%E7%9A%84%E7%89%B9%E7%82%B9%E5%92%8C%E7%89%B9%E6%80%A7%E6%98%AF%E4%BB%80%E4%B9%88
19.storm 编程模型?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#19storm-%E7%BC%96%E7%A8%8B%E6%A8%A1%E5%9E%8B
20.Spark Streaming 和 Storm 有什么不同?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#20spark-streaming-%E5%92%8C-storm-%E6%9C%89%E4%BB%80%E4%B9%88%E4%B8%8D%E5%90%8C
参考链接https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Storm.md#%E5%8F%82%E8%80%83%E9%93%BE%E6%8E%A5
https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#yarn
1.简单介绍 yarn?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#1%E7%AE%80%E5%8D%95%E4%BB%8B%E7%BB%8D-yarn
2.Yarn 有什么特点?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#2yarn-%E6%9C%89%E4%BB%80%E4%B9%88%E7%89%B9%E7%82%B9
3.为什么要使用 Yarn。https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#3%E4%B8%BA%E4%BB%80%E4%B9%88%E8%A6%81%E4%BD%BF%E7%94%A8-yarn
4.yarn 主要作用https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#4yarn-%E4%B8%BB%E8%A6%81%E4%BD%9C%E7%94%A8
5.yarn 的结构https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#5yarn-%E7%9A%84%E7%BB%93%E6%9E%84
6.Yarn 在运行过程中负责给应用分配资源的是什么https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#6yarn-%E5%9C%A8%E8%BF%90%E8%A1%8C%E8%BF%87%E7%A8%8B%E4%B8%AD%E8%B4%9F%E8%B4%A3%E7%BB%99%E5%BA%94%E7%94%A8%E5%88%86%E9%85%8D%E8%B5%84%E6%BA%90%E7%9A%84%E6%98%AF%E4%BB%80%E4%B9%88
7.yarn 的工作流程https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#7yarn-%E7%9A%84%E5%B7%A5%E4%BD%9C%E6%B5%81%E7%A8%8B
8.yarn 的调度器https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#8yarn-%E7%9A%84%E8%B0%83%E5%BA%A6%E5%99%A8
9.YARN 高可用https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#9yarn-%E9%AB%98%E5%8F%AF%E7%94%A8
10.什么是 container?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#10%E4%BB%80%E4%B9%88%E6%98%AF-container
11.Yarn支持的调度器和硬件资源种类?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#11yarn%E6%94%AF%E6%8C%81%E7%9A%84%E8%B0%83%E5%BA%A6%E5%99%A8%E5%92%8C%E7%A1%AC%E4%BB%B6%E8%B5%84%E6%BA%90%E7%A7%8D%E7%B1%BB
12.请问RM节点上有Container容器的这种说法吗?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#12%E8%AF%B7%E9%97%AErm%E8%8A%82%E7%82%B9%E4%B8%8A%E6%9C%89container%E5%AE%B9%E5%99%A8%E7%9A%84%E8%BF%99%E7%A7%8D%E8%AF%B4%E6%B3%95%E5%90%97
13.在AM中,job已经被分成一系列的task,并且是为每个task来startContainer。为什么NM上要存一个application的数据结构呢?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#13%E5%9C%A8am%E4%B8%ADjob%E5%B7%B2%E7%BB%8F%E8%A2%AB%E5%88%86%E6%88%90%E4%B8%80%E7%B3%BB%E5%88%97%E7%9A%84task%E5%B9%B6%E4%B8%94%E6%98%AF%E4%B8%BA%E6%AF%8F%E4%B8%AAtask%E6%9D%A5startcontainer%E4%B8%BA%E4%BB%80%E4%B9%88nm%E4%B8%8A%E8%A6%81%E5%AD%98%E4%B8%80%E4%B8%AAapplication%E7%9A%84%E6%95%B0%E6%8D%AE%E7%BB%93%E6%9E%84%E5%91%A2
14.是否只有负责启动ApplicationMaster的NodeManager才会维护一个Application对象?其他的NodeManager是否是根据ApplicationMaster发起的请求来启动属于这个Application的其他Container,这些NodeManager不需要维护Application的状态机?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#14%E6%98%AF%E5%90%A6%E5%8F%AA%E6%9C%89%E8%B4%9F%E8%B4%A3%E5%90%AF%E5%8A%A8applicationmaster%E7%9A%84nodemanager%E6%89%8D%E4%BC%9A%E7%BB%B4%E6%8A%A4%E4%B8%80%E4%B8%AAapplication%E5%AF%B9%E8%B1%A1%E5%85%B6%E4%BB%96%E7%9A%84nodemanager%E6%98%AF%E5%90%A6%E6%98%AF%E6%A0%B9%E6%8D%AEapplicationmaster%E5%8F%91%E8%B5%B7%E7%9A%84%E8%AF%B7%E6%B1%82%E6%9D%A5%E5%90%AF%E5%8A%A8%E5%B1%9E%E4%BA%8E%E8%BF%99%E4%B8%AAapplication%E7%9A%84%E5%85%B6%E4%BB%96container%E8%BF%99%E4%BA%9Bnodemanager%E4%B8%8D%E9%9C%80%E8%A6%81%E7%BB%B4%E6%8A%A4application%E7%9A%84%E7%8A%B6%E6%80%81%E6%9C%BA
15.Container的节点随机性?https://github.com/BigDataInterviewHub/BigDataInterview/blob/main/Yarn.md#15container%E7%9A%84%E8%8A%82%E7%82%B9%E9%9A%8F%E6%9C%BA%E6%80%A7
Readme https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview#readme-ov-file
Please reload this pagehttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview
Activityhttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/activity
4 starshttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/stargazers
1 watchinghttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/watchers
2 forkshttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2FBigDataInterviewHub%2FBigDataInterview&report=BigDataInterviewHub+%28user%29
Releaseshttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/releases
Packages 0https://patch-diff.githubusercontent.com/users/BigDataInterviewHub/packages?repo_name=BigDataInterview
Contributors 2https://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview/graphs/contributors
Please reload this pagehttps://patch-diff.githubusercontent.com/BigDataInterviewHub/BigDataInterview
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.