I have a number of txt files in a directory that I'd like to combine.The following is an example of three files named df_A, df_B and df_c:
df_A
0 1 2
0 James 1 yes
1 Jake 3 No
2 Jane 2 Yes
df_B
0 1 2
0 Jane 2 No
1 Job 6 No
2 James 1 Yes
df_C
0 1 2
0 Jack 4 No
1 Jenny 7 Yes
2 James 1 No
3 John 9 Yes
AndI'd like the final dataframe to look like this:
ID Name df_A df_B df_C
1 James Yes Yes No
3 Jake No NA NA
2 Jane Yes No NA
6 Job NA Yes NA
4 Jack NA NA No
7 Jenny NA NA Yes
9 John NA NA Yes
this is the code I have thus far...
new_df = pd.DataFrame(columns = ['Name', 'ID'])
for filename in os.listdir('/path'):
if filename.endswith('.txt'):
course = os.path.splitext(filename)[0]
new_df = pd.concat([combined_df,pd.DataFrame(columns=[course])])
data = pd.read_csv(filename, sep="\t", header=None)
for i in data[data.columns[1]]:
if i not in new_df['ID']:
new_df['ID'].append(i)