存储在R中列表中的矩阵的列矩阵乘法？答案

【问题标题】：Columnwise matrix multiplication of matrices stored inside lists in R?存储在R中列表中的矩阵的列矩阵乘法？
【发布时间】：2018-12-08 22:14:12
【问题描述】：

我有两个存储在具有相同维度的列表（L1、L2）中的大型数据集。在每个列表元素 L[[.]] 中，都有一个数据框。我想将两个列表逐列相乘，然后得到 L1[[i]] 所有组合的结果。 L2[[j]] 表示 i 和 j 可能取的任何值。

有关说明，请参见下面的代码

#Data Generation-----------------------
set.seed(200)
a = matrix(sample(50,16), 4)
b = matrix(sample(50,16), 4)
c = matrix(sample(50,16), 4)
d = matrix(sample(50,16), 4)

#Creating index and binding------------
r = cbind(rep(1,4),a)
s = cbind(rep(2,4),b)
o = as.data.frame(rbind(r,s))
t = cbind(rep(1,4),c)
u = cbind(rep(2,4),d)
p = as.data.frame(rbind(t,u))

#Splitting the data -------------------
list.1 = split(o, o$V1) #o$V1 is the index column
list.2 = split(p, p$V1) #o$V1 is the index column

现在，我想在 R 中重现 Excel 函数 SUMPRODUCT。

因此，例如，从上面的数据中，我想要一个简单的数据框，如下所示：

#Col1      #Col2
sum(a*c)   sum(b*c)
sum(a*d)   sum(b*d)

问题在于 a、b、c 和 d 现在都在列表中。我曾尝试使用 for-loops，但无济于事。我尝试使用 split-apply-combine 方法，但也没有成功。

由于两个列表各有 9000 个元素，我正在寻找一个优化的方法来做到这一点。我怎样才能做到这一点？

谢谢。

编辑：根据 userR 的要求扩展每个列表中 3 个元素的示例

#Data Generation-----------------------
set.seed(200)
a = matrix(sample(50,16), 4)
b = matrix(sample(50,16), 4)
c = matrix(sample(50,16), 4)
d = matrix(sample(50,16), 4)
e = matrix(sample(50,16), 4)
f = matrix(sample(50,16), 4)

#Creating index and binding------------
r = cbind(rep(1,4),a)
s = cbind(rep(2,4),b)
m = cbind(rep(3,4),c)
o = as.data.frame(rbind(r,s,m))

t = cbind(rep(1,4),d)
u = cbind(rep(2,4),e)
v = cbind(rep(3,4),f)
p = as.data.frame(rbind(t,u,v))

#Splitting the data -------------------
list.1 = split(o, o$V1) #o$V1 is the index column
list.2 = split(p, p$V1) #o$V1 is the index column

现在每个列表都有 3 个元素，每个元素都包含一个 4x4 矩阵。

我正在寻找的结果结构如下：

#Col1      #Col2      #Col3
sum(a*d)   sum(b*d)   sum(c*d)
sum(a*e)   sum(b*e)   sum(c*e)
sum(a*f)   sum(b*f)   sum(c*f)

对于我真正的问题，结果将是一个包含 9000 列和 9000 行的数组（矩阵或数据框）。

编辑：按照 userR 的建议添加真实数据

用户建议我展示一些真实数据，以便人们知道我的真实数据是什么样的：

dput(list.1[1:3])

list(`1` = structure(list(vol = c(1425.76, 272.52, 0, 0, 31912.78, 
6056.18, 8212.88, 3909.3, 0, 761.06, 22.45, 237.18), i_1 = c(2819.81, 
4026.72, 827.2, 4790.52, 12218.1, 3632.64, 6308.66, 4076.71, 
2192.98, 952.94, 112.84, 170.97), i_2 = c(2857.88, 2914.34, 761.87, 
4412.4, 11046.36, 2363.24, 7761.31, 5431.03, 1337.62, 857, 103.46, 
110.33), i_3 = c(1389.12, 932.86, 238.51, 5046, 5298.57, 3087.9, 
8746.02, 7129.57, 708.53, 549.1, 86.58, 163.15), i_4 = c(1626.96, 
936.04, 377.81, 4909.62, 6323.5, 2766.49, 3746.06, 2858.07, 900.29, 
975.21, 102.76, 295.1), i_5 = c(1653.05, 1724.74, 321.59, 3937.2, 
6966.48, 2614.67, 3326.99, 2371.44, 1082.43, 970.25, 123.51, 
491.92), i_6 = c(1584.14, 3399.31, 392.24, 3957.88, 8042.5, 2614.46, 
2371.67, 1896.1, 1201.83, 1314.06, 161.23, 892.91)), row.names = c(NA, 
12L), class = "data.frame"), `2` = structure(list(vol = c(10774.34, 
287.53, 0, 0, 57507.79, 10692.91, 9028.38, 10355.78, 8900.38, 
3253.59, 22.45, 219), i_1 = c(5760.16, 4315.77, 585.28, 2886.11, 
23767.55, 3095.39, 6705.94, 6445.96, 10612.49, 2470.32, 126.65, 
143.46), i_2 = c(5035.23, 1785.77, 405.05, 4492.64, 21509.39, 
3654.16, 10203.03, 9505.1, 6628.42, 1298.06, 111.76, 110.13), 
    i_3 = c(2798.54, 1920.72, 464.92, 7916.61, 13628.15, 8365.88, 
    18425.9, 22368.93, 2253.38, 1078.65, 101.34, 134.98), i_4 = c(2344.65, 
    1407.02, 369.7, 2889.69, 7618.72, 2110.01, 4982.27, 2250.94, 
    1744.4, 1033.89, 105.74, 212.26), i_5 = c(1653.05, 1724.74, 
    321.59, 3937.2, 6966.48, 2614.67, 3326.99, 2371.44, 1082.43, 
    970.25, 123.51, 491.92), i_6 = c(1584.14, 3399.31, 392.24, 
    3957.88, 8042.5, 2614.46, 2371.67, 1896.1, 1201.83, 1314.06, 
    161.23, 892.91)), row.names = 13:24, class = "data.frame"), 
    `3` = structure(list(vol = c(850.15, 218.58, 0, 0, 38959.27, 
    3081.31, 3441.35, 2760.54, 0, 2826.8, 0, 34.12), i_1 = c(6048.28, 
    3545.14, 1566.05, 2866.46, 20149.24, 1459.03, 2051.68, 2047.74, 
    5059.57, 3369.86, 129.37, 361.49), i_2 = c(1728.12, 1530.14, 
    364.37, 4761.97, 6934.24, 1802.96, 5394.15, 3972.96, 510.25, 
    989.3, 109.05, 322.37), i_3 = c(1182.59, 750.55, 311.19, 
    5540.96, 4544.96, 2535.07, 8926.35, 7209.61, 423.66, 446.62, 
    92.21, 282.36), i_4 = c(1179.1, 645.18, 283.97, 4616.66, 
    5063.41, 3110.14, 9240.41, 4752.86, 744.85, 648.44, 100.52, 
    311.67), i_5 = c(1653.05, 1724.74, 321.59, 3937.2, 6966.48, 
    2614.67, 3326.99, 2371.44, 1082.43, 970.25, 123.51, 491.92
    ), i_6 = c(1584.14, 3399.31, 392.24, 3957.88, 8042.5, 2614.46, 
    2371.67, 1896.1, 1201.83, 1314.06, 161.23, 892.91)), row.names = 25:36, class = "data.frame"))

dput(list.2[1:3])

list(`1` = structure(list(Index = c(1L, 1L, 1L, 1L, 1L, 1L, 1L,  1L,
1L, 1L, 1L, 1L), itenm1 = c(998399, 998399, 998399, 998399,  998399,
998399, 998399, 998399, 998399, 998399, 998399, 998399 ), j_1 =
c(-261.62831, -605.82802, -190.35225, -802.27542, -835.07636, 
-709.70814, -444.26492, -207.96871, -986.93606, -968.29324, -7675.97567, 
-1271.43424), j_2 = c(0, -188.67302, 0, -799.17034, 0, 247.70379,  0, 0, 1051.71715, -27.94787, 0, -13.34628), j_3 = c(0, 0, 0,  0, 0,
-207.58736, 0, 0, -2333.43115, -1346.57579, 0, -205.13053 ), j_4 = c(0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0), j_5 = c(0, 0,  0, 0, 0, 0, 0,
0, 0, 0, 0, 0), j_6 = c(0, 0, 0, 0, 0, 0, 0, 0,  0, 0, 0, 0)),
row.names = c(NA, 12L), class = "data.frame"), 
    `2` = structure(list(Index = c(2L, 2L, 2L, 2L, 2L, 2L, 2L, 
    2L, 2L, 2L, 2L, 2L), itenm1 = c(998398, 998398, 998398, 998398, 
    998398, 998398, 998398, 998398, 998398, 998398, 998398, 998398
    ), j_1 = c(-106.64606, -203.78915, -76.30121, -310.10454, 
    -321.62536, -227.3462, -160.82221, -70.87354, -286.94001, 
    -137.28382, -3779.42484, -604.71574), j_2 = c(0, -96.94433, 
   0, -297.21757, 0, 67.67053, 0, 0, 309.38773, -8.42931, 0, 
   -6.7299), j_3 = c(0, 0, 0, 0, 0, -56.71107, 0, 0, -686.43453, 
   -406.13843, 0, -103.43761), j_4 = c(0, 0, 0, 0, 0, 0, 0, 
   0, 0, 0, 0, 0), j_5 = c(0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 
   0), j_6 = c(0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0)), row.names = 13:24, class = "data.frame"), 
   `3` = structure(list(Index = c(3L, 3L, 3L, 3L, 3L, 3L, 3L, 
   3L, 3L, 3L, 3L, 3L), itenm1 = c(998397, 998397, 998397, 998397, 
   998397, 998397, 998397, 998397, 998397, 998397, 998397, 998397
   ), j_1 = c(-238.10472, -543.97414, -71.04739, -756.58841, 
   -782.7918, -667.84871, -424.38314, -193.82405, -638.12855, 
   -319.65804, -6693.88425, -1189.81911), j_2 = c(0, -182.11783, 
   0, -750.99738, 0, 233.80836, 0, 0, 683.61007, -18.48993, 
   0, -11.46144), j_3 = c(0, 0, 0, 0, 0, -195.94234, 0, 0, -1516.71678, 
   -890.87644, 0, -176.16082), j_4 = c(0, 0, 0, 0, 0, 0, 0, 
   0, 0, 0, 0, 0), j_5 = c(0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 
   0), j_6 = c(0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0)), row.names = 25:36, class = "data.frame"))

【问题讨论】：

你好，用户R。首先，非常感谢。我正在测试你给我的选项。这可能需要一些时间，因为我不是一个快速的编码器并且花时间理解每一行。现在，我正在测试 sumprod_mapply。由于我的列表的实际尺寸不同，我很想了解rep 命令中每个数字的含义。如果解决了，我会将您的答案标记为解决方案。
第一个问题，是的。我将只有两个列表，但在列表的每个元素中都有一个矩阵。每个列表有 9000 个元素，所以它们都有 9000 个矩阵。我想将 (1,1)、(1,2)、(1,3)...(9000,1)...(9000,9000) 中的所有可能的矩阵组合相乘。我的输出预计有 9000 行和 9000 列。将 #s 列视为第一个矩阵的元素的索引，每列 M 将有 9000 行，表示sum(M[#]*N[#]，其中 N[#] 是列表 2 的元素组。
也许这个不太完善的图将有助于理解我的想法。 imgur.com/a/DGRUmlM
查看我更新的解决方案是否符合您的预期输出以及它是否适用于您的实际问题。 Rcpp 方法应该是最快的。
尊敬的用户，再次感谢您的回答。我有几天不在办公室，但我会在 Rcpp 开始工作后立即查看您的答案。我现在遇到了一些麻烦。当我解决它时，我会运行你的答案。再次感谢。

标签： r list apply matrix-multiplication

【解决方案1】：

下面我提出三种方法及其基准：mapply、嵌套for 循环和RcppArmadillo。

数据：

#Data Generation-----------------------
set.seed(200)
a = matrix(sample(50,16), 4)
b = matrix(sample(50,16), 4)
c = matrix(sample(50,16), 4)
d = matrix(sample(50,16), 4)
e = matrix(sample(50,16), 4)
f = matrix(sample(50,16), 4)

#Creating index and binding------------
r = cbind(rep(1,4),a)
s = cbind(rep(2,4),b)
m = cbind(rep(3,4),c)
o = as.data.frame(rbind(r,s,m))

t = cbind(rep(1,4),d)
u = cbind(rep(2,4),e)
v = cbind(rep(3,4),f)
p = as.data.frame(rbind(t,u,v))

#Splitting the data -------------------
list.1 = split(o[,-1], o$V1) #o$V1 is the index column
list.2 = split(p[,-1], p$V1) #o$V1 is the index column

list.1 = lapply(list.1, as.matrix)
list.2 = lapply(list.2, as.matrix)

在这里，我先删除 id 列并将list.1 和list.2 的每个元素转换为矩阵，这有点作弊。这样做可以提高我们函数的性能。

初始化函数：

# Rcpp -----------------------------------------------
library(Rcpp)
library(RcppArmadillo)

cppFunction(depends = "RcppArmadillo",
"arma::mat sumprod_Rcpp(List x, List y){
  List xlist(x);
  List ylist(y);
  int n = xlist.size();
  arma::mat m(n,n);

  for(int i=0; i<n; i++) {
    for(int j=0; j<n; j++){
      arma::mat xMat = xlist[i];
      arma::mat yMat = ylist[j];
      arma::vec v = arma::vectorise(xMat*yMat);
      m(j,i) = sum(v);
    }
  }
  return(m);
}
"
)

# Nested For -----------------------------------------
sumprod_for <- function(x, y){
  mat <- matrix(NA,length(list.1),length(list.1))

  for(i in 1:length(list.1)){
    for(j in 1:length(list.1)){
      mat[j,i] <- sum(x[[i]] %*% y[[j]])
    }
  }
  return(mat)
}

# Mapply ---------------------------------------------
sumprod_mapply <- function(x, y){
  matrix(mapply(function(j, k){
    sum(x[[j]] %*% y[[k]])
  }, 
  rep(1:length(list.1), each = length(list.1)), 
  rep(1:length(list.1), length(list.1))
  ), 
  length(list.1), 
  length(list.1)
  )
}  

# Ryan's sapply --------------------------------------
sumprod_sapply <- function(x, y){
  sapply(x, function(j){
    lapply(y, function(k) sum(j %*% k))
  })
}

检查输出是否相同：

identical(sumprod_mapply(list.1, list.2), matrix(unlist(sumprod_sapply(list.1, list.2)), length(list.1), length(list.1)))
# [1] TRUE
identical(sumprod_mapply(list.1, list.2), sumprod_for(list.1, list.2))
# [1] TRUE
identical(sumprod_mapply(list.1, list.2), sumprod_Rcpp(list.1, list.2))
# [1] TRUE

sumprod_Rcpp(list.1, list.2)
#       [,1]  [,2]  [,3]
# [1,] 44882 40505 49670
# [2,] 29750 26897 32260
# [3,] 45898 41248 50847

sumprod_for(list.1, list.2)
#       [,1]  [,2]  [,3]
# [1,] 44882 40505 49670
# [2,] 29750 26897 32260
# [3,] 45898 41248 50847

sumprod_mapply(list.1, list.2)
#       [,1]  [,2]  [,3]
# [1,] 44882 40505 49670
# [2,] 29750 26897 32260
# [3,] 45898 41248 50847

sumprod_sapply(list.1, list.2)
#   1     2     3    
# 1 44882 40505 49670
# 2 29750 26897 32260
# 3 45898 41248 50847

基准测试：

library(microbenchmark)
microbenchmark(sumprod_mapply(list.1, list.2), 
               sumprod_sapply(list.1, list.2),
               sumprod_for(list.1, list.2),
               sumprod_Rcpp(list.1, list.2),
               times = 10000L)

结果：

Unit: microseconds
                           expr    min     lq      mean median     uq      max neval
 sumprod_mapply(list.1, list.2) 34.345 39.082 51.274501 41.846 62.373 2448.292 10000
 sumprod_sapply(list.1, list.2) 37.108 42.635 56.119414 45.398 67.504 2324.733 10000
    sumprod_for(list.1, list.2) 10.264 13.422 17.685540 15.001 22.502  120.008 10000
   sumprod_Rcpp(list.1, list.2)  2.369  3.948  5.247494  4.738  6.317   88.032 10000

在寻找性能提升时，Rcpp 实现不会出错。但令人惊讶的是，sumprod_for 比sumprod_mapply 和sumprod_sapply 快得多，这可能是因为mapply 隐式地将列表输出强制转换为向量。随意提出更多解决方案，我会将它们添加到基准测试中。

【讨论】：

您好，用户。感谢您的回答。在我的真实案例中，我的列表每个都有 9000 个元素，每个元素都是一个矩阵。我想进一步澄清rep(1:2, each = 2), rep(1:2, 2)), 2, 2) 行。我假设两个rep 命令都在那里，所以你有两次 (1,1,2,2)，然后你可以将它组合为 (1,1)(1,2),(2,1),(2, 2）。但是2, 2) 是干什么用的？如果这个函数是为我原来的列表编写的？
@userR，我刚刚添加了一个扩展示例，每个列表有 3 个矩阵，每个矩阵在一个元素中
sapply(list.1, function(x) lapply(list.2, function(y) sum(x %*% y))) 给出了相同的结果，尽管它可能比 for 循环慢。
我暂时离开了 Rcpp 代码段并尝试使用 for 循环。尝试运行后，出现错误。 Error in x[[i]] %*% y[[j]] : requires numeric/complex matrix/vector arguments。我试过as.matrix()x和y，但没有成功。
@MasonBeau 请删除这些 cmets 并将它们再次发布到您的问题正文中。这些信息首先应该包含在您的问题本身中。