reproducible data
import pandas as pd
import numpy as np
cols1=['b','a','c','a']
data1=[0,0,0,0]
df1=pd.DataFrame([data1], columns= cols1)
df1
cols2=['b','a', 'd', 'a', 'e','f']
data2=[1,1,1,1,1,1]
df2=pd.DataFrame([data2], columns= cols2)
df2
result I want
data = { "b": [0, 1],
"b a" : [0, 1],
"c" : [0, np.NaN],
"c a" : [0, np.NaN],
"d" : [np.NaN, 1],
"d a" : [np.NaN, 1],
"e" : [np.NaN, 1],
"f" : [np.NaN, 1]}
pd.DataFrame(data)
If df have duplicate column name "a", I cannot use "concat" function.
Any good way to deal with duplicate column names?
If there is a "b" before "a", I want to change the corresponding a to "b a".