Color a heatmap in Python/Matplotlib according to requirement

Question

I'm trying to make a heatmap with a specified requirement of the coloring. I want to set an interval for the data and judge that as ok and color it green, the rest of the results should be colored as red. Does anyone have a clue of how to do this?? I have attache a simple example using pandas and matplotlib for better understanding.

import numpy as np 
from pandas import *
import matplotlib.pyplot as plt

Index= ['aaa', 'bbb', 'ccc', 'ddd', 'eee']
Cols = ['A', 'B', 'C', 'D']
data= abs(np.random.randn(5, 4))
df = DataFrame(data, index=Index, columns=Cols)

plt.pcolor(df)
plt.yticks(np.arange(0.5, len(df.index), 1), df.index)
plt.xticks(np.arange(0.5, len(df.columns), 1), df.columns)
plt.show()

example heatmap

Joe Kington · Accepted Answer · 2014-01-03 17:55:45Z

5

There's more than one way to do this.

The easiest way is to just pass in a boolean array to pcolor and then choose a colormap where green is high and red is low.

For example:

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

Index= ['aaa', 'bbb', 'ccc', 'ddd', 'eee']
Cols = ['A', 'B', 'C', 'D']
data= np.random.random((5, 4))
df = pd.DataFrame(data, index=Index, columns=Cols)

plt.pcolor(df > 0.5, cmap='RdYlGn')
plt.yticks(np.arange(0.5, len(df.index), 1), df.index)
plt.xticks(np.arange(0.5, len(df.columns), 1), df.columns)
plt.show()

enter image description here

Alternately, as @Cyber mentioned, you could make a two-color colormap based on your values and use it:

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import matplotlib.colors as mcolors

Index= ['aaa', 'bbb', 'ccc', 'ddd', 'eee']
Cols = ['A', 'B', 'C', 'D']
data= np.random.random((5, 4))
df = pd.DataFrame(data, index=Index, columns=Cols)

# Values from 0-0.5 will be red and 0.5-1 will be green
cmap, norm = mcolors.from_levels_and_colors([0, 0.5, 1], ['red', 'green'])

plt.pcolor(df, cmap=cmap, norm=norm)
plt.yticks(np.arange(0.5, len(df.index), 1), df.index)
plt.xticks(np.arange(0.5, len(df.columns), 1), df.columns)
plt.show()

enter image description here

(The color difference is just because the "RdYlGn" colormap uses darker greens and reds as its endpoints.)

On a side note, it's also considerably faster to use pcolormesh for this, rather than pcolor. For small arrays, it won't make a significant difference, but for large arrays pcolor is excessively slow. imshow is even faster yet, if you don't mind raster output. Use imshow(data, interpolation='nearest', aspect='auto', origin='lower') to match the defaults of pcolor and pcolormesh.

edited Jan 3, 2014 at 17:55

answered Jan 3, 2014 at 17:28

Joe Kington

287k73 gold badges621 silver badges474 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Paul H Over a year ago

side note: matplotlib can infer colormaps from strings alone. i.e., plt.pcolor(df > 0.5, cmap='RdYlGn'). saves some typing, i guess.

Joe Kington Over a year ago

@PaulH - Good point! Thanks! I always tend to remember plt.get_cmap for some reason, but I'd forgotten that it wasn't necessary.

Cory Kramer · Accepted Answer · 2014-01-03 17:18:02Z

0

You can make a 2 color colormap. Then you can set the cutoff value between red and green.

answered Jan 3, 2014 at 17:18

Cory Kramer

119k19 gold badges176 silver badges233 bronze badges

1 Comment

jonas Over a year ago

I'm new to matplotlib, could you show me an example reusing my code?

Collectives™ on Stack Overflow

Color a heatmap in Python/Matplotlib according to requirement

2 Answers 2

2 Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

2 Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related