# hadoop-tutorials **Repository Path**: mirrors_mehikmat/hadoop-tutorials ## Basic Information - **Project Name**: hadoop-tutorials - **Description**: hadoop-tutorials - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2020-08-09 - **Last Updated**: 2025-12-20 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README ##Hadoop Tutorials This repo contains a set of tutorials for Hadoop designed to work inside or alongside the [Hortonworks Sandbox](http://hortonworks.com/products/hortonworks-sandbox/). Sandbox is a single-node Hadoop cluster that runs in a Virtual Machine (Virtual Box, VMWare, Hyper-V). ![image](sandbox.png?raw=true) Please feel free to recommend edits to existing tutorials, and contribute your own. New contributions should be placed in the '[Community](/community)' folder. Current tutorials are as follows: ###Sandbox Tutorials (Embedded in Sandbox and tested with v2.0) * [1 - Hello World - Using Sandbox with Hive and Pig](/Sandbox/T01_Hello_World_Using_Sandbox_with_Hive_and_Pig.md) * [2 - Data Processing with Pig - Processing Baseball Stats](/Sandbox/T02_Data_Processing_with_Pig.md) * [3 - Data Processing with Hive - Processing Baseball Stats](/Sandbox/T03_Data_Processing_with_Hive.md) * [4 - HCatalog, Basic Pig and Hive Commands](/Sandbox/T04_HCatalog_Basic_Pig_and_Hive.md) * [5 - Using Basic Pig Commands](/Sandbox/T05_Using_Basic_Pig_Commands.md) * [6 - Loading Data into the Hadoop](/Sandbox/T06_Loading_Data_into_Sandbox.md) * [7 - Installing the Hortonworks ODBC driver on Windows 7](/Sandbox/T07_Installing_the_Hortonworks_ODBC_Driver_on_Windows_7.md) * [8 - Using Excel 2013 to Access Hadoop Data](/Sandbox/T08_Using_Excel_2013_to_Access_Hadoop_data.md) * [9 - Using Excel 2013 to Analyze Hadoop Data](/Sandbox/T09_Using_Excel_2013_to_Analyze_Hadoop_data.md) * [10 - Visualizing Website Clickstream Data](/Sandbox/T10_Visualizing_Website_Clickstream_Data.md) * [11 - Installing the Hortonworks ODBC driver on Mac OS X](/Sandbox/T11_Installing_the_Hortonworks_ODBC_driver_on_Mac_OSX.md) * [12 - Refining and Visualizing Server Log Data](/Sandbox/T12_Refining_and_Visualizing_Server_Log_Data.md) * [13 - Refining and Visualizing Sentiment Data](/Sandbox/T13_Refining_and_Visualizing_Sentiment_Data.md) * [14 - Analyzing Machine and Sensor Data](/Sandbox/T14_Analyzing_Machine_and_Sensor_Data.md) ###Community Tutorials Feel free to contribute tutorials and help the community harness Hadoop! * [1 - Using RHadoop to Predict Web Visitors](/Community/T01_RHadoop_visitors_prediction.md) from [mwacc](https://github.com/mwacc) * [2 - Using Spring XD to stream Tweets to Hadoop for Sentiment Analysis](/Community/T02_Spring_XD_Hadoop_Twitter.md) from [mehzer](https://github.com/mehzer) * [3 - Word Counting with Pig](/Community/T03_Word_Counting_With_Pig.md) from [flacrosse](https://github.com/flacrosse) * [4 - Import from Microsoft SQL Server into the Hortonworks Sandbox using Sqoop](/Community/T04_Import_from_Microsoft_SQL_Server_into_the_Hortonworks_Sandbox_using_Sqoop.md) from [flacrosse](https://github.com/flacrosse)