Next-generation big data a practical guide to Apache Kudu, Impala, and Spark

Utilize this practical and easy-to-follow guide to modernize traditional enterprise data warehouse and business intelligence environments with next-generation big data technologies. Next-Generation Big Data takes a holistic approach, covering the most important aspects of modern enterprise big data....

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	Quinto, Butch (VerfasserIn)
Format:	Elektronisch E-Book
Sprache:	English
Veröffentlicht:	[Place of publication not identified] Apress [2018]
Schlagworte:	Spark (Electronic resource : Apache Software Foundation) Big data Data mining Electronic data processing > Management Data Mining Données volumineuses Exploration de données (Informatique) COMPUTERS ; Data Processing Electronic data processing ; Management
Online-Zugang:	lizenzpflichtig
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

MARC


LEADER	00000cam a22000002 4500
001	ZDB-30-ORH-047606819
003	DE-627-1
005	20240228120528.0
007	cr uuu---uuuuu
008	191023s2018 xx \|\|\|\|\|o 00\| \|\|eng c
035			\|a (DE-627-1)047606819
035			\|a (DE-599)KEP047606819
035			\|a (ORHE)9781484231470
035			\|a (DE-627-1)047606819
040			\|a DE-627 \|b ger \|c DE-627 \|e rda
041			\|a eng
082	0		\|a 005.7 \|2 23
100	1		\|a Quinto, Butch \|e VerfasserIn \|4 aut
245	1	0	\|a Next-generation big data \|b a practical guide to Apache Kudu, Impala, and Spark \|c Butch Quinto
264		1	\|a [Place of publication not identified] \|b Apress \|c [2018]
264		4	\|c ©2018
300			\|a 1 online resource (1 volume) \|b illustrations
336			\|a Text \|b txt \|2 rdacontent
337			\|a Computermedien \|b c \|2 rdamedia
338			\|a Online-Ressource \|b cr \|2 rdacarrier
500			\|a Includes bibliographical references. - Online resource; title from cover (Safari, viewed July 9, 2018)
520			\|a Utilize this practical and easy-to-follow guide to modernize traditional enterprise data warehouse and business intelligence environments with next-generation big data technologies. Next-Generation Big Data takes a holistic approach, covering the most important aspects of modern enterprise big data. The book covers not only the main technology stack but also the next-generation tools and applications used for big data warehousing, data warehouse optimization, real-time and batch data ingestion and processing, real-time data visualization, big data governance, data wrangling, big data cloud deployments, and distributed in-memory big data computing. Finally, the book has an extensive and detailed coverage of big data case studies from Navistar, Cerner, British Telecom, Shopzilla, Thomson Reuters, and Mastercard. What You'll Learn: Install Apache Kudu, Impala, and Spark to modernize enterprise data warehouse and business intelligence environments, complete with real-world, easy-to-follow examples, and practical advice Integrate HBase, Solr, Oracle, SQL Server, MySQL, Flume, Kafka, HDFS, and Amazon S3 with Apache Kudu, Impala, and Spark Use StreamSets, Talend, Pentaho, and CDAP for real-time and batch data ingestion and processing Utilize Trifacta, Alteryx, and Datameer for data wrangling and interactive data processing Turbocharge Spark with Alluxio, a distributed in-memory storage platform Deploy big data in the cloud using Cloudera Director Perform real-time data visualization and time series analysis using Zoomdata, Apache Kudu, Impala, and Spark Understand enterprise big data topics such as big data governance, metadata management, data lineage, impact analysis, and policy enforcement, and how to use Cloudera Navigator to perform common data governance tasks Implement big data use cases such as big data warehousing, data warehouse optimization, Internet of Things, real-time data ingestion and analytics, complex event processing, and scalable predictive modeling Study real-world big data case studies from innovative companies, including Navistar, Cerner, British Telecom, Shopzilla, Thomson Reuters, and Mastercard.
630	2	0	\|a Spark (Electronic resource : Apache Software Foundation)
650		0	\|a Big data
650		0	\|a Data mining
650		0	\|a Electronic data processing \|x Management
650		2	\|a Data Mining
650		4	\|a Spark (Electronic resource : Apache Software Foundation)
650		4	\|a Données volumineuses
650		4	\|a Exploration de données (Informatique)
650		4	\|a COMPUTERS ; Data Processing
650		4	\|a Big data
650		4	\|a Data mining
650		4	\|a Electronic data processing ; Management
856	4	0	\|l TUM01 \|p ZDB-30-ORH \|q TUM_PDA_ORH \|u https://learning.oreilly.com/library/view/-/9781484231470/?ar \|m X:ORHE \|x Aggregator \|z lizenzpflichtig \|3 Volltext
912			\|a ZDB-30-ORH
912			\|a ZDB-30-ORH
951			\|a BO
912			\|a ZDB-30-ORH
049			\|a DE-91

Datensatz im Suchindex

DE-BY-TUM_katkey	ZDB-30-ORH-047606819
_version_	1818767308082905088
adam_text
any_adam_object
author	Quinto, Butch
author_facet	Quinto, Butch
author_role	aut
author_sort	Quinto, Butch
author_variant	b q bq
building	Verbundindex
bvnumber	localTUM
collection	ZDB-30-ORH
ctrlnum	(DE-627-1)047606819 (DE-599)KEP047606819 (ORHE)9781484231470
dewey-full	005.7
dewey-hundreds	000 - Computer science, information, general works
dewey-ones	005 - Computer programming, programs, data, security
dewey-raw	005.7
dewey-search	005.7
dewey-sort	15.7
dewey-tens	000 - Computer science, information, general works
discipline	Informatik
format	Electronic eBook
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>03855cam a22004812 4500</leader><controlfield tag="001">ZDB-30-ORH-047606819</controlfield><controlfield tag="003">DE-627-1</controlfield><controlfield tag="005">20240228120528.0</controlfield><controlfield tag="007">cr uuu---uuuuu</controlfield><controlfield tag="008">191023s2018 xx \|\|\|\|\|o 00\| \|\|eng c</controlfield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627-1)047606819</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)KEP047606819</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(ORHE)9781484231470</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627-1)047606819</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">005.7</subfield><subfield code="2">23</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Quinto, Butch</subfield><subfield code="e">VerfasserIn</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Next-generation big data</subfield><subfield code="b">a practical guide to Apache Kudu, Impala, and Spark</subfield><subfield code="c">Butch Quinto</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">[Place of publication not identified]</subfield><subfield code="b">Apress</subfield><subfield code="c">[2018]</subfield></datafield><datafield tag="264" ind1=" " ind2="4"><subfield code="c">©2018</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 online resource (1 volume)</subfield><subfield code="b">illustrations</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">Computermedien</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Online-Ressource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Includes bibliographical references. - Online resource; title from cover (Safari, viewed July 9, 2018)</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Utilize this practical and easy-to-follow guide to modernize traditional enterprise data warehouse and business intelligence environments with next-generation big data technologies. Next-Generation Big Data takes a holistic approach, covering the most important aspects of modern enterprise big data. The book covers not only the main technology stack but also the next-generation tools and applications used for big data warehousing, data warehouse optimization, real-time and batch data ingestion and processing, real-time data visualization, big data governance, data wrangling, big data cloud deployments, and distributed in-memory big data computing. Finally, the book has an extensive and detailed coverage of big data case studies from Navistar, Cerner, British Telecom, Shopzilla, Thomson Reuters, and Mastercard. What You'll Learn: Install Apache Kudu, Impala, and Spark to modernize enterprise data warehouse and business intelligence environments, complete with real-world, easy-to-follow examples, and practical advice Integrate HBase, Solr, Oracle, SQL Server, MySQL, Flume, Kafka, HDFS, and Amazon S3 with Apache Kudu, Impala, and Spark Use StreamSets, Talend, Pentaho, and CDAP for real-time and batch data ingestion and processing Utilize Trifacta, Alteryx, and Datameer for data wrangling and interactive data processing Turbocharge Spark with Alluxio, a distributed in-memory storage platform Deploy big data in the cloud using Cloudera Director Perform real-time data visualization and time series analysis using Zoomdata, Apache Kudu, Impala, and Spark Understand enterprise big data topics such as big data governance, metadata management, data lineage, impact analysis, and policy enforcement, and how to use Cloudera Navigator to perform common data governance tasks Implement big data use cases such as big data warehousing, data warehouse optimization, Internet of Things, real-time data ingestion and analytics, complex event processing, and scalable predictive modeling Study real-world big data case studies from innovative companies, including Navistar, Cerner, British Telecom, Shopzilla, Thomson Reuters, and Mastercard.</subfield></datafield><datafield tag="630" ind1="2" ind2="0"><subfield code="a">Spark (Electronic resource : Apache Software Foundation)</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Big data</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Data mining</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Electronic data processing</subfield><subfield code="x">Management</subfield></datafield><datafield tag="650" ind1=" " ind2="2"><subfield code="a">Data Mining</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Spark (Electronic resource : Apache Software Foundation)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Données volumineuses</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Exploration de données (Informatique)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">COMPUTERS ; Data Processing</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Big data</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data mining</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Electronic data processing ; Management</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="l">TUM01</subfield><subfield code="p">ZDB-30-ORH</subfield><subfield code="q">TUM_PDA_ORH</subfield><subfield code="u">https://learning.oreilly.com/library/view/-/9781484231470/?ar</subfield><subfield code="m">X:ORHE</subfield><subfield code="x">Aggregator</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">BO</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-91</subfield></datafield></record></collection>
id	ZDB-30-ORH-047606819
illustrated	Illustrated
indexdate	2024-12-18T08:47:49Z
institution	BVB
language	English
open_access_boolean
owner	DE-91 DE-BY-TUM
owner_facet	DE-91 DE-BY-TUM
physical	1 online resource (1 volume) illustrations
psigel	ZDB-30-ORH
publishDate	2018
publishDateSearch	2018
publishDateSort	2018
publisher	Apress
record_format	marc
spelling	Quinto, Butch VerfasserIn aut Next-generation big data a practical guide to Apache Kudu, Impala, and Spark Butch Quinto [Place of publication not identified] Apress [2018] ©2018 1 online resource (1 volume) illustrations Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Includes bibliographical references. - Online resource; title from cover (Safari, viewed July 9, 2018) Utilize this practical and easy-to-follow guide to modernize traditional enterprise data warehouse and business intelligence environments with next-generation big data technologies. Next-Generation Big Data takes a holistic approach, covering the most important aspects of modern enterprise big data. The book covers not only the main technology stack but also the next-generation tools and applications used for big data warehousing, data warehouse optimization, real-time and batch data ingestion and processing, real-time data visualization, big data governance, data wrangling, big data cloud deployments, and distributed in-memory big data computing. Finally, the book has an extensive and detailed coverage of big data case studies from Navistar, Cerner, British Telecom, Shopzilla, Thomson Reuters, and Mastercard. What You'll Learn: Install Apache Kudu, Impala, and Spark to modernize enterprise data warehouse and business intelligence environments, complete with real-world, easy-to-follow examples, and practical advice Integrate HBase, Solr, Oracle, SQL Server, MySQL, Flume, Kafka, HDFS, and Amazon S3 with Apache Kudu, Impala, and Spark Use StreamSets, Talend, Pentaho, and CDAP for real-time and batch data ingestion and processing Utilize Trifacta, Alteryx, and Datameer for data wrangling and interactive data processing Turbocharge Spark with Alluxio, a distributed in-memory storage platform Deploy big data in the cloud using Cloudera Director Perform real-time data visualization and time series analysis using Zoomdata, Apache Kudu, Impala, and Spark Understand enterprise big data topics such as big data governance, metadata management, data lineage, impact analysis, and policy enforcement, and how to use Cloudera Navigator to perform common data governance tasks Implement big data use cases such as big data warehousing, data warehouse optimization, Internet of Things, real-time data ingestion and analytics, complex event processing, and scalable predictive modeling Study real-world big data case studies from innovative companies, including Navistar, Cerner, British Telecom, Shopzilla, Thomson Reuters, and Mastercard. Spark (Electronic resource : Apache Software Foundation) Big data Data mining Electronic data processing Management Data Mining Données volumineuses Exploration de données (Informatique) COMPUTERS ; Data Processing Electronic data processing ; Management TUM01 ZDB-30-ORH TUM_PDA_ORH https://learning.oreilly.com/library/view/-/9781484231470/?ar X:ORHE Aggregator lizenzpflichtig Volltext
spellingShingle	Quinto, Butch Next-generation big data a practical guide to Apache Kudu, Impala, and Spark Spark (Electronic resource : Apache Software Foundation) Big data Data mining Electronic data processing Management Data Mining Données volumineuses Exploration de données (Informatique) COMPUTERS ; Data Processing Electronic data processing ; Management
title	Next-generation big data a practical guide to Apache Kudu, Impala, and Spark
title_auth	Next-generation big data a practical guide to Apache Kudu, Impala, and Spark
title_exact_search	Next-generation big data a practical guide to Apache Kudu, Impala, and Spark
title_full	Next-generation big data a practical guide to Apache Kudu, Impala, and Spark Butch Quinto
title_fullStr	Next-generation big data a practical guide to Apache Kudu, Impala, and Spark Butch Quinto
title_full_unstemmed	Next-generation big data a practical guide to Apache Kudu, Impala, and Spark Butch Quinto
title_short	Next-generation big data
title_sort	next generation big data a practical guide to apache kudu impala and spark
title_sub	a practical guide to Apache Kudu, Impala, and Spark
topic	Spark (Electronic resource : Apache Software Foundation) Big data Data mining Electronic data processing Management Data Mining Données volumineuses Exploration de données (Informatique) COMPUTERS ; Data Processing Electronic data processing ; Management
topic_facet	Spark (Electronic resource : Apache Software Foundation) Big data Data mining Electronic data processing Management Data Mining Données volumineuses Exploration de données (Informatique) COMPUTERS ; Data Processing Electronic data processing ; Management
url	https://learning.oreilly.com/library/view/-/9781484231470/?ar
work_keys_str_mv	AT quintobutch nextgenerationbigdataapracticalguidetoapachekuduimpalaandspark

Next-generation big data a practical guide to Apache Kudu, Impala, and Spark

MARC

Datensatz im Suchindex

Ähnliche Einträge