DATAANALYTICS

In this project, I conducted an analysis of one day's sales data for the Farfetch company in the USA. To perform the analysis, I utilized Python Pandas for data extraction and analysis. Additionally, I used Tableau and Pandas plot to visualize the results
The project began with the initial step of adapting the provided data in CSV format for further analysis. This involved refining the spreadsheet by removing columns with redundant information. As the analysis was conducted in an improvisational manner, I chose to explore specific fields of interest.
First, I examined the most popular product categories and identified the 10-30 most indemand brands and subcategories. Furthermore, I investigated the ratio between discounted and non-discounted products, as well as the categories that experienced the highest proportion of discounted sales.
It is important to note that I did not conduct comparative analyses across different dates. As such, it is plausible that the observed significant differences in certain categories could be attributed to it being the first day of sales of the particular categories.
Overall, this analysis provides valuable insights into the sales patterns and dynamics of Farfetch in the USA on a single day. These findings can assist in making informed business decisions and understanding customer preferences
Amount of unique brands 3400
Currency: USD, EUR
Date: June 23, 2023
Destination: US
Total Amount of sold items: 588.825
Most expensive item: USD 1 407 796.68 pre-owned fine watches
From Audemars Piguet
Smallest price: pencil sharpener from makeup category - USD 6.43
1 day sales turnover in local currency (USD)465 992 100.54
<class pandas core frame DataFrame>
RangeIndex: 588825 entries, 0 to 588824
Data columns (total 14 columns):
# Column Non-Null Count Dtype
0
DATA
1
2
3
7
8
One of the most popularcategoriesDresses, and 10 most popular brends were sorted out
Day Dresses analysis, top 10 brands
PERCENTAGE OF FULL PRICE ITEMS AND DISCOUNTED ITEMS
30 MOST POPULAR SHOE BRANDS, TOTAL SALES IN USD
discounted
ITEMS SOLD MOSTLY WITH DISCOUNTS, DISCOUNTS MIGHT HAVE BEEN THE PRIMARY DRIVER FOR SALES IN THESE CATEGORIES
SHOES AND ACTIVEWEAR WEREN'T AFFECTED MUCH WITH DISCOUNTS
30 european countries The oldest company established in Spain in 1898
There are 66 componies with less than or equal to 5 employees
39 Busines sectors
According to the next table, it is possible to get a short presentation about the data provided
"Sectorial concentration across countries shows that countries with the largest income specialize or cover the majority of sectors. The technology sector is the most popular
plt.figure(figsize=(8,4))
sns histplot(data=UK,x='Sector',pal ette='Pastel5')
plt.xticks(rotation=90)
plt.title('employees Distribution in UK',fontsize = 20) plt show()
The same analysis was made via Tableau in a more attractive and simple way
Based on the data obtained, it is also possible to conduct a comparative analysis of the distribution of employees across sectors
Amount changes of companies ranked in 2020 and 2017
Next, I decided to investigate the UK market separately because it is no longer part of the European Union
To eliminate UK from the list of European Countries possible with the next command: data without uk = EUC[EUC['Country'] != 'UK'
Interesting facts: out of the total number of companies, 66 of them have 5 or fewer employees in their staff, with 10 of those companies among the first 100."
Italy has the biggest concentration of companies
Based on these two graphs, it can be concluded that positive changes are observed in all sectors in a more or less systematic manner However, sectors such as construction and financial services have switched positions, with the financial services sector surpassing construction in terms of revenue by the year 2020. Similarly, the transportation sector, which was previously ranked fifth in popularity, has declined to the tenth position
The main goal was to extract the necessary data to determine the best selling items in the bakery, as well as to determine the most popular days and times of the day among visitors. The use of such data will allow to optimize working hours, introduce some additional services and tailor the production of certain products only on the best-selling days, thereby reducing costs and emissions.
I prefer to use Python Pandas for extracting the necessary data and for being able to present the information graphically through additional libraries like seaborn and matplotlib In this case, I created a column "sold per minute" , and to simplify further research, it was decided to order the time based on an interval of 60 minutes, which made it possible to find out the most popular visiting hours of the bakery
In the Excel table, I made some improvements to replace the names of the days of the week, which are numbered from 0 to 6 by default in Python But it was also necessary to review and adapt all the data for further presentation in the scoreboard
In Tableau I made research to determine the best-selling group of bakery products (top 20), as well as the definition of the most visited days and times of the day In fact, the data obtained is a repetition of what was done in Pandas Python, but it took less time
The most popular weekdays
The busiest and freest hours
In this project, I conducted an analysis of one day's sales data for the Farfetch company in the USA. To perform the analysis, I utilized Python Pandas for data extraction and analysis. Additionally, I used Tableau and Pandas plot to visualize the results
The project began with the initial step of adapting the provided data in CSV format for further analysis. This involved refining the spreadsheet by removing columns with redundant information. As the analysis was conducted in an improvisational manner, I chose to explore specific fields of interest.
First, I examined the most popular product categories and identified the 10-30 most indemand brands and subcategories. Furthermore, I investigated the ratio between discounted and non-discounted products, as well as the categories that experienced the highest proportion of discounted sales.
It is important to note that I did not conduct comparative analyses across different dates. As such, it is plausible that the observed significant differences in certain categories could be attributed to it being the first day of sales of the particular categories.
Overall, this analysis provides valuable insights into the sales patterns and dynamics of Farfetch in the USA on a single day. These findings can assist in making informed business decisions and understanding customer preferences