Python + Excel - deriving a correlated mean from 2 data groups?

What I'm doing - I have two columns in Excel: price and borough(British for district). I've generated the overall data set using python too, so I already have a list composed of the various boroughs. I want a breakdown of the mean price for each value associated with the borough list. So far - I've written a program which returns the mean value of the entire numerical data set which is the price:

import pandas as pd
file = "/Users/my_name/Documents/Startup Ideas/Python Data /file.xlsx"
df = pd.ExcelFile("/Users/my_name/Documents/Startup Ideas/Python Data /file.xlsx").parse("Sheet1")

x = []

x.append(df["Price"])
mean_value = df["Price"].mean()

The borough list is:

["Chelsea", "Kensington", "Westminster", "Pimlico", "Bank", "Holborn", "Camden", "Islington", "Angel", "Battersea", "Knightsbridge", "Bermondsey", "Newham"]

How would I add another column i.e the from the borough list and return a mean price distribution per borough? Thanks very much in advance.

I really don't know where to start

In terms of the source code for the input data:

    SIZE = 70_000
BOROUGHS = ["Chelsea", "Kensington", "Westminster", "Pimlico", "Bank", "Holborn", "Camden", "Islington", "Angel", "Battersea", "Knightsbridge", "Bermondsey", "Newham"]

np.random.seed(1)
data3 = pd.DataFrame({"Sq. feet" : np.random.randint(low=75, high=325, size=SIZE),
  "Price" : np.random.randint(low=200000, high=1250000, size=SIZE),
  "Borough" : [random.choice(BOROUGHS) for _ in range(SIZE)]
})

edited Dec 7, 2023 at 0:14

furas

149k12 gold badges121 silver badges171 bronze badges

asked Dec 6, 2023 at 20:56

Zarathustra

133 bronze badges

maybe you need df.groupby('Borough').mean()

furas
– furas

2023-12-07 00:15:33 +00:00
Commented Dec 7, 2023 at 0:15
group by - Python Pandas How to assign groupby operation results back to columns in parent dataframe? - Stack Overflow

furas
– furas

2023-12-07 00:17:22 +00:00
Commented Dec 7, 2023 at 0:17

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Python + Excel - deriving a correlated mean from 2 data groups?

0

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest

Linked