Зарегистрироваться
Восстановить пароль
FAQ по входу

Apache Hadoop

Apache Hadoop — свободно распространяемый набор утилит, библиотек и фреймворк для разработки и выполнения распределённых программ, работающих на кластерах из сотен и тысяч узлов.
Используется для реализации поисковых и контекстных механизмов многих высоконагруженных веб-сайтов, в том числе, для Yahoo! и Facebook.
Разработан на Java в рамках вычислительной парадигмы MapReduce, согласно которой приложение разделяется на большое количество одинаковых элементарных заданий, выполнимых на узлах кластера и естественным образом сводимых в конечный результат.
  • Без фильтрации типов файлов
Packt Publishing, 2018. — 220 p. — ASIN B07K46H6VV. !Code files only A fast paced guide that will help you learn about Apache Hadoop 3 and its ecosystem Key Features Set up, configure and get started with Hadoop to get useful insights from large data sets Work with the different components of Hadoop such as MapReduce, HDFS and YARN Learn about the new features introduced in...
  • №1
  • 1015,45 КБ
  • добавлен
  • описание отредактировано
Manning Publications, 2021. — 482 p. — ISBN 978-1617296901. Code Files Only! Data Pipelines with Apache Airflow teaches you the ins-and-outs of the Directed Acyclic Graphs (DAGs) that power Airflow, and how to write your own DAGs to meet the needs of your projects. With complete coverage of both foundational and lesser-known features, when you’re done you’ll be set to start...
  • №2
  • 219,47 КБ
  • добавлен
  • описание отредактировано
Packt, 2018. — 482 p. Explore big data concepts, platforms, analytics, and their applications using the power of Hadoop 3 Apache Hadoop is the most popular platform for big data processing, and can be combined with a host of other big data tools to build powerful analytics solutions. Big Data Analytics with Hadoop 3 shows you how to do just that, by providing insights into the...
  • №3
  • 947,14 КБ
  • добавлен
  • описание отредактировано
Packt Publishing, 2019. — 531 p. — ISBN: 1788620445. !Code files only. Apache Hadoop is one of the most popular big data solutions for distributed storage and for processing large chunks of data. With Hadoop 3, Apache promises to provide a high-performance, more fault-tolerant, and highly efficient big data processing platform, with a focus on improved scalability and increased...
  • №4
  • 247,13 КБ
  • добавлен
  • описание отредактировано
Packt Publishing, 2016. — 979 p. — ISBN: 978-1-78712-516-2. Unlock the power of your data with Hadoop 2.X ecosystem and its data warehousing techniques across large data sets As Marc Andreessen has said “Data is eating the world,” which can be witnessed today being the age of Big Data, businesses are producing data in huge volumes every day and this rise in tide of data need to...
  • №5
  • 472,99 КБ
  • добавлен
  • описание отредактировано
Packt Publishing, 2017. — 206 p. — ISBN: 139781787124769. This book will teach you how to deploy large-scale dataset in deep neural networks with Hadoop for optimal performance. Starting with understanding what deep learning is, and what the various models associated with deep neural networks are, this book will then show you how to set up the Hadoop environment for deep...
  • №6
  • 29,74 КБ
  • добавлен
  • описание отредактировано
В этом разделе нет файлов.

Комментарии

В этом разделе нет комментариев.