Skip to content

DekarLab

Blog about big data processing and data-driven investments

  • DATA PROCESSING
  • INVESTMENTS
  • MONEY BUILDER
  • INDEX TRACKING (ETF)
  • PDI/R PLUGIN
  • BOOKS
  • MISC
  • Impressum
DekarLab

Tag: hadoop

Authentication in Hadoop cluster: MIT Kerberos and Active Directory

23. May 2020 karden DATA PROCESSING

There are different options how to activate kerberos in Hadoop cluster.

Read more
[Total: 0   Average: 0/5]
Post Views: 208

Kerberos: overview

22. May 2020 karden DATA PROCESSING

Kerberos authentication protocol is needed to secure Hadoop cluster. This is the only way to make Hadoop cluster secure.

Read more
[Total: 0   Average: 0/5]
Post Views: 155

Authentication and authorization in Hadoop cluster

6. February 2020 karden DATA PROCESSING

Here we explain concepts behind activation of security in Hadoop cluster.

Read more
[Total: 0   Average: 0/5]
Post Views: 194

HBase is next step in your big data technology stack

10. September 2017 karden DATA PROCESSING

Read more

[Total: 0   Average: 0/5]

Post Views:
277

Improving performance by reading data with Hive for HDFS using subfolders (partitioning)

6. June 2017 karden DATA PROCESSING

In ourĀ  previous article we have discussed the root structure for HDFS. In this article we will discuss next level of the file structure, which will help to improve the speed of reading data.

Read more

[Total: 1   Average: 5/5]

Post Views:
284

Short note about HDFS or why you need distributed file system

21. May 2017 karden DATA PROCESSING

Why do you need HDFS (Hadoop Distributed Files System)? If the amount of data is small and place on your computer is enough for this, then you do not need distributed file system. But if you like to process a large amount of data, which is not possible to save on one computer, then you need to think about distributed file system.

Read more

[Total: 0   Average: 0/5]

Post Views:
255

Tags

all (50) article (3) auth (3) book (5) code on github (5) data lake (12) design (18) gui xmdm (1) hadoop (6) hbase (1) hive (1) index tracking (4) informatica (1) k8s (5) kafka (1) kylin (3) microservices (6) mondrian (1) money builder (5) OLAP in hadoop (3) pentaho (3) phd thesis (1) spark (4) zeppelin (2)
WordPress Theme: Poseidon by ThemeZee.