Hadoop: Fundamentals training course is designed to give you the basic overview of the Hadoop framework. The course covers the fundamental concepts to process and analyze large sets of data stored in HDFS. The course also briefly touches the more advanced concepts like Sqoop and Flume for data ingestion. The more details about these advanced concepts is covered under the course Hadoop: Intermediate.
The Hadoop: Fundamentals course is part of a two course series which covers the essential concepts in getting to know Hadoop and the big-data analytics. With increasing digital trend in the world, the importance of big data and data analytics is going to continue growing in the coming years. This course will enable the candidates to explore opportunities in this growing field of digital science.
This course will teach students about Hadoop architecture, ETL, and MapReduce. This course will teach students about Hadoop architecture, ETL, and MapReduce. This course will teach students about Hadoop architecture, ETL, and MapReduce. This course will teach students about Hadoop architecture, ETL, and MapReduce. This course will teach students about Hadoop architecture, ETL, and MapReduce.
Hadoop is becoming an essential tool in the ever-growing Big-Data architecture. This training is designed to benefit:
Hadoop: Intermediate training course is designed to give you in-depth knowledge about the Hadoop framework discussed in our Hadoop and MapReduce Fundamentals course. The course covers the concepts to process and analyze large sets of data stored in HDFS. It teaches Sqoop and Flume for data ingestion.
The Hadoop: Intermediate course is part of a two course series which covers the essential concepts in getting to know Hadoop and the big-data analytics. With increasing digital trend in the world, the importance of big data and data analytics is going to continue growing in the coming years. This course will enable the candidates to explore opportunities in this growing field of digital science.
Hadoop is becoming an essential tool in the ever-growing Big-Data architecture. This training is designed to benefit:
Excel 2013 is a game changer for any Excel user. Excel 2013 provides new technologies and tools for the business user to easily analyze data to create powerful reports and dashboards in a fraction of the time of traditional spreadsheets.
This course is for all business users who want to learn how to create management reports, analyze data and create interactive dashboards using Excel 2013.
In this course we learn the following:
This course will provide you with an easy to follow step by step approach to using the new Business Intelligence technology in Excel 2013.
Create amazing reports and analyze data in minutes with Excel Pivot Tables. This course focuses on creating and understanding how to use Excel Pivot Tables to quickly analyze and understand your business data. Pivot Tables make it easy to compile management reports and generate new insight and intelligence from your data in minutes.
This Excel Business Intelligence training course is for all business users who are required to develop management reports, analyze data or ask ad-hoc questions of their data. McKinsey Consulting in a recent report entitled 'Big data: The next frontier for innovation, competition, and productivity' emphasizes that data skills will become the basis for competition, innovation, growth and productivity. The report also highlights the need for around 1.5 million managers in United States with data skills.
This course has over 25 video lectures that take you step by step through creating management reports and analyzing data with a couple of clicks of the mouse. The course is structured into 9 sections which takes you from creating simple management reports to sophisticated data analysis.
Learn techniques to easily aggregate your data, produce reports, understand trends, filter your data, create interactive reports and use sophisticated calculations.
So begin your journey today with Excel Business Intelligence.
This course is for all Excel users who wish to learn how to create powerful interactive dashboards using Pivot Charts and Pivot Tables. This course will teach you to turn your Excel data into powerful tables, graphs and dashboards.
In this course you will learn to create three different dashboards.
The first dashboard will focus on Sales information creating key metrics, trends, top 10 analysis and a percentage contribution.
The second dashboard will focus on Human Resource data and display key metrics, number of employees by age, number of employees by years of service and gender profile.
The third dashboard focuses on financial information and allows the user to understand the variance of the Profit from previous months and how much it has changed from month to month. In addition techniques and tools such as sparklines, data bars, trendlines and slicers are also included.
This course is very important for all Excel users who are required to develop management reporting or to perform data analysis and wish to gain insight and intelligence from their data.
The course is designed to take you through step by step to create the interactive dashboards using simple drag and drop techniques. This course does not include any custom programming or macros. All the dashboards and techniques can be followed by any Excel user who understands the basic principles of Excel.
An introductory knowledge of Pivot Tables and Pivot Charts will assist you in the course. However you can follow the step by step instructions if you do not have an in-depth knowledge of Pivot Tables and Pivot Charts. Please note that you will require Excel 2010 or 2013 for this course. The course is presented using Excel 2013.
Power Pivot has been described as the most important new feature in Excel in 20 years. Power Pivot has been developed by Microsoft to help companies analyze and understand the growing quantities of data that are created on a daily basis.
Power Pivot will improve your time to create reports, develop data analysis and gain insight into your your data by at least 80%.
Power Pivot is a free Add-in for Excel 2010 and is called the Data Model in Excel 2013 that provides capabilities to handle large quantities of data. PowerPivot is a columnar database that is capable of storing and manipulating hundreds of millions rows of data. PowerPivot brings new capabilities to business users to analyse, explore and gain new insight into business and marketing data. PowerPivot has been referred to as the most important feature for Excel in 20 years.
Power Pivot makes it easy to setup relationships between tables of data in the same way as databases and therefore does not require the use of Excel functions such as Vlookup. PowerPivot therefore decreases the time substantially to analyse and gain new insight from business and marketing data.
Power Pivot allows the rapid development of powerful PivotTables and PivotCharts in Excel utilising the latest Slicer technology to easily analyse and view data from multiple different perspectives. PowerPivot also adds a new Data Analysis Expressions (DAX) formula language that provides new and powerful calculations and insight to the business user.
If you are a Google Analytics user you will know how much data can be provided to you. However it is not always easy to access this data and make sense of it within Google Analytics. In this course Ian Littlejohn will show you how to easily create your own custom Google Data Studio reports and dashboards that will make it easy to understand your Google Analytics data.
Google Data Studio is Google's latest report and dashboard creation tool. Using a drag and drop interface reports can be created and running in minutes. Google Data Studio also makes it incredibly easy to share your reports with other users.
In this course we will show you how to easily create the following:
Connect to the Google Analytics data source
Understand the Google Data Studio interface
Create Scorecards displaying key information such as Sessions and Users
Create Time Series / Trend graphs displaying how Sessions and Users changes over time
How to setup different filters for your data
Display information using column, bar and pie charts
Create effective maps and geographical analysis
Create powerful interactive dashboards
This course is for anybody who wants to effectively analyze and understand their Google Analytics data.
In this short and concise course you will learn everything to get started with web scraping using Java.
You will learn the concepts behind web scraping that you can apply to practically any web page (static AND dynamic / AJAX).
We start with an overview of what web scraping is and what you can do with it.
Then we explain the difference in scraping static pages vs dynamic / AJAX pages. You learn how to classify a website in one of the two categories and then apply the right concept in order to scrape the data you want.
Now you will learn how to export the scraped data either as CSV or JSON. These are some popular formats that can be used for further processing.
Unfortunately many websites try to block scrapers or sometimes you just do not want to be detected. In the section going undercover you will learn how to stay undetected and avoid getting blocked.
At the end of the course you can download the full source code of all the lectures and we discuss an outlook to some advanced topics (private proxies, cloud deployment, multi threading ...). Those advanced topics are covered in a follow up course I am going to teach.
Why you should take this course
Stop imagining you can scrape data from websites and use the skills for your next web project, you can do it now.
Web scraping is the art of picking out data from a website by looking at the HTML code and identifying patterns that can be used to identify your data. This data can then be gathered and later used for your own analysis.
In this course we will go over the basic of web scraping, learning all about how we can extract data from websites, and all of this is guided along by a work example.
At the end of the course you should be able to go off on your own, and pick out most common
APIs are available on most modern websites, and provide an easy way to integrate the websites functionalities into your code. The API that will be focused on is the Twitter API, which which will be used to mine tweets about the event.
Social media, especially twitter, is becoming a hot topic among many investors, as its trends can often predict behavior of the stock market. This course will focus on how Twitter data can be live streamed, and will feature a worked example of the Yahoo hack, that was revealed on December 14th, 2016.
Python programmers are some of the most sought-after employees in the tech world, and Python itself is fast becoming one of the most popular programming languages. One of the best applications of Python however is data analysis; which also happens to be something that employers can't get enough of. Gaining skills in one or the other is a guaranteed way to boost your employability – but put the two together and you'll be unstoppable!
Become and expert data analyser
Python data analytics made Simple
This course contains 51 lectures and 6 hours of content, specially created for those with an interest in data analysis, programming, or the Python programming language. Once you have Python installed and are familiar with the language, you'll be all set to go.
The course begins with covering the fundamentals of Pandas (the library of data structures you'll be using) before delving into the most important functions you'll need for data analysis; creating and navigating data frames, indexing, visualising, and so on. Next, you'll get into the more intricate operations run in conjunction with Pandas including data manipulation, logical categorising, statistical functions and applications, and more. Missing data, combining data, working with databases, and advanced operations like resampling, correlation, mapping and buffering will also be covered.
By the end of this course, you'll have not only have grasped the fundamental concepts of data analysis, but through using Python to analyse and manipulate your data, you'll have gained a highly specific and much in demand skill set that you can put to a variety of practical used for just about any business in the world.
Python: Python is a general purpose programming language with a focus on readability and concise code, making it a great language for new coders to learn. Learning Python gives a solid foundation for learning more advanced coding languages, and allows for a wide variety of applications.
Pandas: Pandas is a free, open source library that provides high-performance, easy to use data structures and data analysis tools for Python; specifically, numerical tables and time series. If your project involves lots of numerical data, Pandas is for you.
NumPy: Like Pandas, NumPy is another library of high level mathematical functions. The difference with NumPy however is that was specifically created as an extension to the Python programming language, intended to support large multi-dimensional arrays and matrices.
Data and analytics are becoming increasingly important in our world and in modern day businesses. To start off with data analytics (and ultimately provide nice images of our results), we need to be able to plot our data, preferably in the way we imagine it in our heads.
Matplotlib provides many great plotting opportunities and methods for data visualization, and in this course we will be looking at some introductory methods for getting started with creating plots in Python.
Once we have a starting point for plotting data we can easily expand our knowledge to different areas to make sure we can best represent all of our data.
Scraping data from webpages can be a tedious job. But it doesn’t have to be.
With Scrapy, you can scrape using XPath or CSS. With the large number of examples from both techniques, you’re sure to find a solution that fits for you.
Whether your targeting data on a single page or multiple, Scrapy can handle the job. No matter if the data is within a list, you can scrape specific patterns right out of the list. Building up your specific Scrapy job isn't a difficult task.
Scrapy is a Python library. If you're familiar with Python, XPath or CSS, you'll feel right at home using Scrapy.
At the end of this course, you will understand:
- what Scrapy is used for
- how to install it
- how to use Scapy
In summary, you'll be able to target specific elements on a webpage, whether the element is stand along or in a list. Then you can retrieve a group of those elements or just one. This technique allows you to pull down specific types of data.
The course ends with a project to help solidify what you've learned. There is a full walk through included with the project solution.
BeautifulSoup is a popular Python library for extracting data from HTML or live pages. It isn't limited to a single webpage. You can extract data from multiple webpages. In fact, one of the examples we use does just that. Knowing how to find data within the HTML tree is key to getting targeted data. This course will show you how to identify that data within the HTML tree. Then you'll build a parsing rule to extract it using BeautifulSoup. With a number of examples to ensure you know exactly how to find data, build parsing rules and the needed code to execute the extract, you'll walk away from this course feeling confident in your abilities to retrieve data from webpages.