OK I just experienced the same problem, with the same symptom : df[column][n] changed type after n>32767
I indeed had a problem in my data, but not at all at line 32767
Finding and modifying these few problematic lines solved my problem.
I managed to localize the line that was problematic by using the following extremely dirty routine :
df = pd.read_csv('data.csv',chunksize = 10000)
i=0
for chunk in df:
print "{} {}".format(i,chunk["Custom Dimension 02"].dtype)
i+=1
I ran this and I obtained :
0 int64
1 int64
2 int64
3 int64
4 int64
5 int64
6 object
7 int64
8 object
9 int64
10 int64
Which told me that there was (at least) one problematic line between 60000 and 69999 and one between 80000 and 89999
To localize them more precisely, you can just take a smaller chunksize and print only the number of the rows that do not have the correct dta type
df = pd.read_csv("data.csv", skiprows=32768)is the dtype wrong?skiprows=[32768]. You skipped the first 32768 rows without the[]skiprows=[32768], I still havedf["CallGuid"][32767]aslonganddf["CallGuid"][32768]asunicodedf[CallGuid'] = df['CallGui'].astype(int64)