![]() |
Excel is a powerful tool that Data Analysts use to transform raw data into actionable insights, aiding decision-making and business success. Analysts play a detective-like role, identifying patterns and interpreting trends to convey complex data in a straightforward manner. In this tutorial, we will develop a project aimed at analysing Indian Cities Electricity Consumption using Excel providing guidance on how to effectively gather, process, analyse, and present the relevant data to derive actionable insights for the stakeholders. Table of Content The Six Steps of Data Analysis in ExcelIn Data Analysis, there are 6 steps in total:
Excel Project for Data Analysis: Indian Cities Electricity Consumption 2017-19Let’s demonstrate the data analysis process using dataset Indian cities’ electricity consumption (2017-2019), This dataset contains records of electricity consumption for various cities across India over the years 2017 to 2019 with multiple rows and columns representing different aspects of electricity usage. Usage of the Dataset:
Analysing this dataset can be beneficial to make informed decisions based on consumption patterns and trends. Step 1: Define Problem StatementFor the phase 1: The stakeholders want some details:
Step 2: Load and Prepare the DatasetAfter downloading, we will upload the dataset on Microsoft Excel. ![]() Dataset uploaded in Excel For more refer to : How to Import, Edit, Load and Consolidate Data in Excel Power Query? After uploading the data, it is necessary to check whether the data is clean or not. Step 3: Data Processing : CleaningData integrity refers to the quality of data being accurate, complete, consistent, and trustworthy throughout its entire lifecycle, ensuring its reliability and suitability for analysis. Cleaning data, in this context, signifies that the data possesses these attributes before undergoing analysis. For Data Cleaning, we will use below tools and techniques:
Within this project, we aim to address the presence of missing values, specifically identifying columns containing “NA” values through the application of conditional formatting. This approach enables the visual highlighting of such occurrences for further scrutiny. Additionally, we will employ the COUNTIF function to quantitatively assess the extent of missing values within the dataset. By leveraging these methods, we can systematically identify and evaluate the presence of missing data, facilitating the subsequent steps of data cleaning and analysis. After applying conditional formatting and COUNTIF, the application is shown below. ![]() Conditional Formatting To count the number of cells that matched a specified value, we will use the formula: =COUNTIF(C2:H48, “NA”) ![]() COUNTIF Example We will remove the “NA” values because it makes the dataset incomplete. While in big datasets, we can remove these values by deleting the complete row or Interpolate Missing Values. But since our dataset is not very large, it is better to change the “NA” values to 0 as the result will be negligible. For which we will use “Find and select” option. ![]() Find And Select Click Find and Select and select Replace. ![]() Replace ‘”NA” by ‘0’ Click Replace all. ![]() All the “NA” values are converted to 0 Hence, data is cleaned.
Step 4: Analysing The datasetAfter data cleaning, dataset can be analysed. 4. 1 To know the value of Consumption of Electricity (in lakh units) for Commercial purpose for Indore City. For this, we will use VLOOKUP function, It searches for a certain value in a column to return a corresponding piece of information. The formula for VLOOKUP is given below: =VLOOKUP(lookup value, range containing the lookup value, the column number in the range containing the return value, Approximate match (TRUE) or Exact match (FALSE)). ![]() Example: Vlookup function (Indore City electricity consumption – Commercial purpose) 4.2 To determine the aggregate electricity consumption across Indian cities. We will employ the SUM function for data aggregation. This function allows us to calculate the total consumption by summing up the values within the specified range, which in this case is from cell H2 to H48 using the formula: =SUM(H2:H48) ![]() Using Sum function to find the total consumption by all the cities Additionally, the Pivot Table tool can serve as an alternative method for aggregating and summarizing data. By utilizing Pivot Tables, we can efficiently analyze and present the total electricity consumption across various cities in a structured and customizable format. It’s a powerful tool to calculate, summarize and analyze data that lets you see comparisons, patterns, and trends in your data. For moving on to next problem statement, we will be using: Exploring Data with PivotTables in Excel To accomplish the tasks in the project, we will leverage the functionality of PivotTables within Excel, facilitating a structured and comprehensive analysis of the provided dataset. Through this approach, we aim to derive detailed insights into the electricity consumption patterns of Indian cities, enabling informed decision-making by the stakeholders. 4.3 Now, next major problem, which city consumes maximum and minimum amount of electricity for Industry Purpose? Steps to find the value:
![]() Pivot Table Options
![]() Value Field Settings
![]() Click OK ![]() Pivot Example 2
4.4 Total electricity consumption on per year basis. Steps to find the value:
![]() Pivot Table to calculate maximum and minimum electricity consumption per year 4.5 If there is any trend(increase/decrease) for the electricity consumption? Let’s search for varanasi city, steps to find the trend of Varanasi City for the year 2016-17 and 2017-18:
![]() Selecting Varanasi City
![]() Selecting Year
![]() Difference in electricity consumption trend Step 5: Visualization: Display the findings with correct type of chartsEmploying graphical representations such as charts, we can effectively illustrate and present the conclusions drawn from our analysis. Utilizing charts enables us to visually depict various aspects of the data, including trends, comparisons, and distributions, thereby enhancing the clarity and comprehensibility of our findings. Bar Chart for Total Electricity Consumption Yearly ![]() Data Visualization Charts using Column
![]() Data Visualization using Line Chart The graph shows that total electricity consumption for industrial purposes varies across the cities measured. The highest electricity consumption is around 140,00 kWh and the lowest is around 258 kWh. Pie Chart for Total Electricity Consumption- Public Water and Street Light ![]() Data Visualization using Pie Chart
Step 6: Draw Conclusions and Employ Strategic decisionsIn the Act phase, stakeholders are presented with the opportunity to enact strategic decisions based on the insights derived from the analysis of electricity consumption patterns in Indian cities. These decisions may include:
ConclusionData Analysts uses Excel to find some in-depth insights from the data which can blow everyone’s mind and use such insights to make the system more efficient. |
Reffered: https://www.geeksforgeeks.org
AI ML DS |
Type: | Geek |
Category: | Coding |
Sub Category: | Tutorial |
Uploaded by: | Admin |
Views: | 13 |