【发布时间】:2020-02-27 01:49:38
【问题描述】:
数据从第二次出现下划线开始被忽略,这应该被排序和重复需要被消除。
awk -F_ '{print $2}' file1 >> file 2; sort file1 | uniq ; i tried
******来自********
GGGGGGG DDDDD --> header
XYSER_YURTZ SUMOT_2_058A
XYSER_YURTZ SUMOT_2_058B
XYSER_YURTZ HJRIT_6_51A
XYSER_YURTZ HJRIT_6_51B
XYSER_YURTZ HJRIT_6_51C
XYSER_YURTZ HJRIT_6_51D
XYSER_YURTZ HJRIT_6_51E
XYSER_YURTZ HJRIT_6_51F
XYSER_YURTZ HJRIT_6_520
XYSER_YURTZ HJRIT_6_521
XYSER_GFRE SUMOT_2_16C3
XYSER_GFRE SUMOT_2_16C4
XYSER_GFRE SUMOT_2_16C5
XYSER_GFRE SUMOT_2_16C6
XYSER_GFRE SUMOT_2_16C7
XYSER_GFRE SUMOT_2_16C8
XYSER_GFRE SUMOT_2_16C9
XYSER_GFRE SUMOT_2_16CA
XYSER_GFRE SUMOT_2_16CB
XYSER_GFRE SUMOT_2_16CC
XYSER_GFRE SUMOT_2_16CD
XYSER_GFRE SUMOT_2_16CE
XYSER_GFRE SUMOT_2_16CF
XYSER_GFRE SUMOT_2_16D0
XYSER_GFRE SUMOT_2_16D1
XYSER_GFRE SUMOT_2_16D2
XYSER_GFRE SUMOT_2_16D3
XYSER_GFRE SUMOT_2_16D4
XYSER_GFRE HJRIT_6_12E1
XYSER_GFRE HJRIT_6_12E2
XYSER_GFRE HJRIT_6_12E3
XYSER_GFRE HJRIT_6_12E4
XYSER_GFRE HJRIT_6_12E5
XYSER_GFRE HJRIT_6_12E6
XYSER_GFRE HJRIT_6_12E7
XYSER_GFRE HJRIT_6_12E8
XYSER_GFRE HJRIT_6_12E9
XYSER_GFRE HJRIT_6_12EA
XYSER_GFRE HJRIT_6_12EB
XYSER_GFRE HJRIT_6_12EC
XYSER_GFRE HJRIT_6_12ED
XYSER_ALY1 XYSER_ALY1_0000
XYSER_ALY SUMOT_2_0497
XYSER_ALY SUMOT_2_0498
XYSER_BAP01 SUMOT_2_020E
到
**************输出1**************
GGGGGGG DDDDD
XYSER_YURTZ SUMOT_2
XYSER_YURTZ HJRIT_6
XYSER_GFRE SUMOT_2
XYSER_GFRE HJRIT_6
XYSER_ALY1 XYSER_ALY1
XYSER_ALY SUMOT_2
XYSER_BAP01 SUMOT_2
XYSER_BAP02 SUMOT_2
**************输出2**************
DDDDD GGGGGGG
SUMOT_2 XYSER_YURTZ
SUMOT_2 XYSER_GFRE
SUMOT_2 XYSER_ALY
SUMOT_2 XYSER_BAP01
SUMOT_2 XYSER_BAP02
HJRIT_6 XYSER_YURTZ
HJRIT_6 XYSER_GFRE
XYSER_ALY1 XYSER_ALY1
【问题讨论】:
-
适合显示示例数据、输出和代码!您能否将输入量减少到大约 10 行,并且您需要根据输入向我们展示所需输出的样本。请标记 int
required output(对吗?)。同时显示您当前的输出并描述问题(我认为您已经做到了)。祝你好运。 -
通过您的示例输入,您可以使用
sed 's/_[^_]*$//' inputfile|sort|uniq。这将删除最后一个下划线和所有后续字符。 -
@Bodo 谢谢它正在工作
标签: unix awk text-processing