Transform pandas dataframe to nested JSON

Question

I have a dataframe like below. My question is transforming this into a nested JSON structure like below.

class	code	hasAttribute
XYZ	ABC	Y
XYZ	BCD	N
XYY	CDE	Y

[
  {
    "class": 'XYZ',
    "series": [
      {
        'Code': 'ABC',
        'hasAttribute': 'Y'
      }, 
      {
        'Code': 'BCD',
        'hasAttribute': 'N'
      }
    ]
  },
  {
    "class": 'XYY',
    "series": [
      {
        'Code': 'CDE',
        'hasAttribute': 'Y'
      }
    ]
  }
]

Here series consists of 'Code' and 'hasAttribute' so each code has either 'Y' or 'N' attribute.

user17242583 · Accepted Answer · 2022-03-26 01:58:56Z

2

You can use groupby and then use to_dict to form each group accordingly:

lst = df.groupby('class').apply(lambda x: {'class': x['class'].unique()[0], 'series': x.drop('class', axis=1).to_dict('records')}).tolist()

Output:

>>> lst
[
  {
    'class': 'XYY',
    'series': [
      {
        'Code': 'CDE',
        'hasAttribute': 'Y'
      }
    ]
  },
  {
    'class': 'XYZ',
    'series': [
      {
        'Code': 'ABC',
        'hasAttribute': 'Y'
      },
      {
        'Code': 'BCD',
        'hasAttribute': 'N'
      }
    ]
  }
]

answered Mar 26, 2022 at 1:58

user17242583

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Transform pandas dataframe to nested JSON

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related