I am trying to read all the .txt files with the below format provided and concat them to a single pandas dataframe.
sample1.txt
ID a123
Delivery_person_ID VADRES03DEL01
Delivery_person_Age 24.00
Delivery_person_Ratings 4.30
Name: 1, dtype: object
sample2.txt
ID b123
Delivery_person_ID VADRES03DEL02
Delivery_person_Age 22.00
Delivery_person_Ratings 4.10
Name: 2, dtype: object
Below is the code -
folder_path = '/drive/My Drive/dataset/train'
file_list = glob.glob(folder_path + "/*.txt")
main_dataframe = pd.read_fwf(file_list[0], header=None)
for i in range(1,len(file_list)):
df = pd.read_fwf(file_list[i], header=None)
main_dataframe = pd.concat([main_dataframe, df], axis = 0)
print(main_dataframe.head(30))
Output:
0 1
0 ID a123
1 Delivery_person_ID VADRES03DEL01
2 Delivery_person_Age 24.00
3 Delivery_person_Ratings 4.30
4 Name: 1, dtype: object NaN
0 ID b123
1 Delivery_person_ID VADRES03DEL02
2 Delivery_person_Age 22.00
3 Delivery_person_Ratings 4.10
4 Name: 2, dtype: object NaN
But I need the dataframe to be listed row wise for each person. For eg, in below format I want -
ID Delivery_person_ID Delivery_person_Age Delivery_person_Ratings
0 a123 VADRES03DEL01 24.00 4.30
1 b123 VADRES03DEL02 22.00 4.10