【问题标题】:Drawing a FacetGrid of QQ-plots with seaborn使用 seaborn 绘制 QQ 图的 FacetGrid
【发布时间】:2016-11-09 21:15:27
【问题描述】:

我无法用seaborn 绘制QQ-plotsFacetGrid

我有一个 m 行(观察)和 n 列(特征)的矩阵,我想为每个特征(列)绘制一个 QQ 图,以将其与正态分布进行比较。

到目前为止,我的代码是这样的:

import scipy.stats as ss

def qqplots(fpath, expr, title):

    def quantile_plot(x, **kwargs):
        x = ss.zscore(x)
        qntls, xr = ss.probplot(x, dist="norm")
        plt.scatter(xr, qntls, **kwargs)

    expr_m = pd.melt(expr)
    expr_m.columns = ["Feature", "Value"]
    n_feat = len(expr_m["Feature"].value_counts().index)

    n_cols = int(np.sqrt(n_feat)) + 1

    g = sns.FacetGrid(expr_m, col="Feature", col_wrap=n_cols)
    g.map(quantile_plot, "Value");
    plt.savefig(fpath + ".pdf", bbox_inches="tight")
    plt.savefig(fpath + ".png", bbox_inches="tight")
    plt.close()

qqplots("lognorm_qqplot", np.log2(expr), "Log-normal qqplot")

expr 变量是一个带有 m 行(观察)和 n 列(特征)的 pandas DataFrame。

我得到的异常如下:

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-52-f9333a55702e> in <module>()
     39     plt.close()
     40 
---> 41 qqplots("lognorm_qqplot", np.log2(expr), "Log-normal qqplot")

<ipython-input-52-f9333a55702e> in qqplots(fpath, expr, title)
     34 
     35     g = sns.FacetGrid(expr_m, col="Feature", col_wrap=n_cols)
---> 36     g.map(quantile_plot, "Value");
     37     plt.savefig(fpath + ".pdf", bbox_inches="tight")
     38     plt.savefig(fpath + ".png", bbox_inches="tight")

/usr/local/lib/python3.5/site-packages/seaborn/axisgrid.py in map(self, func, *args, **kwargs)
    726 
    727             # Draw the plot
--> 728             self._facet_plot(func, ax, plot_args, kwargs)
    729 
    730         # Finalize the annotations and layout

/usr/local/lib/python3.5/site-packages/seaborn/axisgrid.py in _facet_plot(self, func, ax, plot_args, plot_kwargs)
    810 
    811         # Draw the plot
--> 812         func(*plot_args, **plot_kwargs)
    813 
    814         # Sort out the supporting information

<ipython-input-52-f9333a55702e> in quantile_plot(y, **kwargs)
     25         y = ss.zscore(y)
     26         qntls, xr = ss.probplot(y, dist="norm")
---> 27         plt.scatter(xr, qntls, **kwargs)
     28 
     29     expr_m = pd.melt(expr)

/usr/local/lib/python3.5/site-packages/matplotlib/pyplot.py in scatter(x, y, s, c, marker, cmap, norm, vmin, vmax, alpha, linewidths, verts, edgecolors, hold, data, **kwargs)
   3249                          vmin=vmin, vmax=vmax, alpha=alpha,
   3250                          linewidths=linewidths, verts=verts,
-> 3251                          edgecolors=edgecolors, data=data, **kwargs)
   3252     finally:
   3253         ax.hold(washold)

/usr/local/lib/python3.5/site-packages/matplotlib/__init__.py in inner(ax, *args, **kwargs)
   1810                     warnings.warn(msg % (label_namer, func.__name__),
   1811                                   RuntimeWarning, stacklevel=2)
-> 1812             return func(ax, *args, **kwargs)
   1813         pre_doc = inner.__doc__
   1814         if pre_doc is None:

/usr/local/lib/python3.5/site-packages/matplotlib/axes/_axes.py in scatter(self, x, y, s, c, marker, cmap, norm, vmin, vmax, alpha, linewidths, verts, edgecolors, **kwargs)
   3838         y = np.ma.ravel(y)
   3839         if x.size != y.size:
-> 3840             raise ValueError("x and y must be the same size")
   3841 
   3842         s = np.ma.ravel(s)  # This doesn't have to match x, y in size.

ValueError: x and y must be the same size

【问题讨论】:

  • 'ss' 是全局变量还是模块?!
  • 糟糕,忘记添加了。它是scipy.stats。编辑感谢
  • @fbrundu 不是答案,但您可能想看看我是如何在这里实现的:phobson.github.io/mpl-probscale/tutorial/…

标签: python matplotlib plot statistics seaborn


【解决方案1】:

我实现了这一点,并且还更改了颜色以使用 Seaborn 调色板,代码如下:

def qqplots(fpath, expr, title):

    def quantile_plot(x, **kwargs):
        x = ss.zscore(x)
        ss.probplot(x, plot=plt)

    expr_m = pd.melt(expr)
    expr_m.columns = ["Feature", "Value"]
    n_feat = len(expr_m["Feature"].value_counts().index)

    n_cols = int(np.sqrt(n_feat)) + 1

    g = sns.FacetGrid(expr_m, col="Feature", col_wrap=n_cols)
    g.map(quantile_plot, "Value");
    for ax in g.axes:
        ax.get_lines()[0].set_markerfacecolor(sns.color_palette()[0])
        ax.get_lines()[1].set_color(sns.color_palette()[3])
    plt.savefig(fpath + ".pdf", bbox_inches="tight")
    plt.savefig(fpath + ".png", bbox_inches="tight")
    plt.close()

qqplots("lognorm_qqplot", np.log2(expr), "Log-normal qqplot")

【讨论】:

  • statsmodels.api 应用qqplot 时,此答案似乎中断。我得到一个空的绘图网格,然后是每个人qqplot
  • 我现在无法测试它。如果代码不再工作,请发布另一个答案。谢谢。
猜你喜欢
  • 2018-12-09
  • 2015-06-17
  • 2014-10-31
  • 1970-01-01
  • 1970-01-01
  • 2015-07-10
  • 2020-09-26
  • 1970-01-01
  • 2014-09-12
相关资源
最近更新 更多