示例文件是 8 位(不是 10 位),存储格式比较棘手。
该工具允许您选择格式。
合适的格式如下:
帧分为两个场 - 上场和下场(隔行格式)。
每个文件的分辨率为 176x72。
因为格式是YUV420,所以U和V字段的大小都是88x36。
代码示例使用以下阶段:
- 读取 Y、U 和 V 的高位字段(每个元素 8 位)。
- 读取 Y、U 和 V 的低场。
- 交错上场和下场。
- 将 U 和 V 上采样到 Y 的大小。
- 将 YUV 转换为 RGB(使用现有的 MATLAB 函数
ycbcr2rgb)。
以下代码示例读取第一帧并转换为 RGB:
fname = 'tulips_yuv420_inter_planar_qcif.yuv';
width = 176;
height = 144;
fid = fopen(fname, 'r'); % Open the video file
Y0 = (fread(fid, [width, height/2], 'uint8'))'; %Read upper field of Y plane
U0 = (fread(fid, [width/2, height/4], 'uint8'))'; %Read lower field of Y plane
V0 = (fread(fid, [width/2, height/4], 'uint8'))'; %Read upper field of U plane
Y1 = (fread(fid, [width, height/2], 'uint8'))'; %Read upper field of Y plane
U1 = (fread(fid, [width/2, height/4], 'uint8'))'; %Read lower field of U plane
V1 = (fread(fid, [width/2, height/4], 'uint8'))'; %Read lower field of V plane
fclose(fid);
%Interleave upper and lower fields
Y = zeros(height, width);
Y(1:2:end, :) = Y0;
Y(2:2:end, :) = Y1;
U = zeros(height/2, width/2);
U(1:2:end, :) = U0;
U(2:2:end, :) = U1;
V = zeros(height/2, width/2);
V(1:2:end, :) = V0;
V(2:2:end, :) = V1;
U = imresize(U, size(Y), 'bicubic');
V = imresize(V, size(Y), 'bicubic');
YUV = cat(3, Y, U, V);
%Convert YUV to RGB (MATLAB function ycbcr2rgb uses BT.601 conversion formula).
RGB = ycbcr2rgb(uint8(YUV));
figure,imshow(RGB)
结果:
读取 10 位 YUV420:
假设:
- 每个 10 位组件存储在 2 个字节中(没有“位打包”)。
- 数据存储在每个字节的下部(每个
uint16 元素的值都在[0, 1023] 范围内)。
- 存储格式是与
uint8 示例相同的非标准interlace 格式。
从 8 位样本构建 10 位 YUV420 样本文件(单帧测试):
以下代码从 8 位样本中构建了一个 10 位样本(将范围从存储在 uint8 中的 8 位扩展到存储在 uint16 中的 10 位)。
fname = 'tulips_yuv420_inter_planar_qcif.yuv';
width = 176;
height = 144;
fid = fopen(fname, 'r'); % Open the video file
Y0 = (fread(fid, [width, height/2], 'uint8'))'; %Read upper field of Y plane
U0 = (fread(fid, [width/2, height/4], 'uint8'))'; %Read lower field of Y plane
V0 = (fread(fid, [width/2, height/4], 'uint8'))'; %Read upper field of U plane
Y1 = (fread(fid, [width, height/2], 'uint8'))'; %Read upper field of Y plane
U1 = (fread(fid, [width/2, height/4], 'uint8'))'; %Read lower field of U plane
V1 = (fread(fid, [width/2, height/4], 'uint8'))'; %Read lower field of V plane
fclose(fid);
fid = fopen('10bits__tulips_yuv420_inter_planar_qcif.yuv', 'w'); % Open for writing
fwrite(fid, uint16(Y0'*(1023/255)), 'uint16'); %1023 = 2^10-1, and 255 = 2^8-1
fwrite(fid, uint16(U0'*(1023/255)), 'uint16');
fwrite(fid, uint16(V0'*(1023/255)), 'uint16');
fwrite(fid, uint16(Y1'*(1023/255)), 'uint16');
fwrite(fid, uint16(U1'*(1023/255)), 'uint16');
fwrite(fid, uint16(V1'*(1023/255)), 'uint16');
fclose(fid);
读取 10 位 YUV420
以下代码读取 10 位 YUV420 的单帧(假设匹配列表):
fname = '10bits__tulips_yuv420_inter_planar_qcif.yuv';
width = 176;
height = 144;
fid = fopen(fname, 'r'); % Open the video file
Y0 = (fread(fid, [width, height/2], 'uint16'))'; %Read upper field of Y plane
U0 = (fread(fid, [width/2, height/4], 'uint16'))'; %Read lower field of Y plane
V0 = (fread(fid, [width/2, height/4], 'uint16'))'; %Read upper field of U plane
Y1 = (fread(fid, [width, height/2], 'uint16'))'; %Read upper field of Y plane
U1 = (fread(fid, [width/2, height/4], 'uint16'))'; %Read lower field of U plane
V1 = (fread(fid, [width/2, height/4], 'uint16'))'; %Read lower field of V plane
fclose(fid);
%Interleave upper and lower fields
Y = zeros(height, width);
Y(1:2:end, :) = Y0;
Y(2:2:end, :) = Y1;
U = zeros(height/2, width/2);
U(1:2:end, :) = U0;
U(2:2:end, :) = U1;
V = zeros(height/2, width/2);
V(1:2:end, :) = V0;
V(2:2:end, :) = V1;
U = imresize(U, size(Y), 'bicubic');
V = imresize(V, size(Y), 'bicubic');
YUV = cat(3, Y, U, V);
%Convert elements range from [0, 1023] to range [0, 1] (MATLAB function ycbcr2rgb supports doubles in range [0, 1]).
YUV = YUV/1023; %1023 applies 10 bits range. 2^10-1 = 1023
%Convet YUV to RGB (MATLAB function ycbcr2rgb uses BT.601 conversion formula).
RGB = ycbcr2rgb(YUV);
%Convert from double to uint8 (from range [0, 1] to range [0, 255]).
RGB = im2uint8(RGB);
figure,imshow(RGB)
注意:
代码 YUV = YUV/1023 将“10 位”格式转换为 [0, 1] double 格式。
使用转换是因为ycbcr2rgb 不支持 10 位输入。
计算文件的大小:
你是对的:“一帧的大小等于 1.5*Y 的大小”。
假设 10 位分量存储在 2 个字节中,Y 的大小为宽*高*2,一帧的大小为宽*高*3。