Está en la página 1de 30

Gii thiu nhm v mn hc

K thut nn Video s
+ Gii thiu chung +M hnh nn nh +Cc tham s hnh nh +L thuyt thng tin Entropy +Cc phng php nn +M RLC +M Shannon Fano +M Huffman +Nn trong nh +Nn lin kt nh +Chun JPEG +Chun MPEG +Chun MPEG - 2

Vi nt v k thut nn video dng MPEG


Ni chung, tn hiu video thng cha ng mt lng ln cc thng tin tha, chng thng c chia thnh hai loi: tha tnh bn trong tng frame (statistical) v tha ng gia cc frame (subjective). Mc ch ca nn video l nhm lm gim s bit khi lu tr v khi truyn bng cch pht hin loi b cc lng thng tin d tha ny v dng cc k thut Entropy m ho ti thiu ho lng tin quan trng cn gi li.

GII THIU
K thut nn nh s ang ng mt vai tr cc k quan trng trong cc h thng vin thng v multimedia gii quyt vn bng thng ca ng truyn. Cc k thut nn video u c gng lm gim lng thng tin cn thit cho mt chui cc bc nh m khng lm gim cht lng ca n i vi ngi xem. Ni chung, tn hiu video thng cha ng mt lng ln cc thng tin tha, chng thng c chia thnh hai loi: tha tnh bn trong tng frame (statistical) v tha ng gia cc frame (subjective). Mc ch ca nn video l nhm lm gim s bit khi lu tr v khi truyn bng cch pht hin loi b cc lng thng tin d tha ny v dng cc k thut Entropy m ho ti thiu ho lng tin quan trng cn gi li.

Nn khng mt d liu
i vi dng nn khng mt d liu, nh c khi phc hon ton ging nh gc, tuy nhin iu ny i hi phi c thit b lu tr v ng truyn ln hn. Cc thut ton ca nn khng mt d liu thng da vo vic thay th mt nhm cc k t trng lp bi mt nhm cc k t c bit khc ngn hn m khng quan tm ti ngha ca dng bit d liu. Cc v d ca dng nn khng mt d liu l Run-Length Encoding (RLE), Huffman Coding, Arithmetic coding, Shannon-Fano Coding, LZ78, LZH, LZW....

Nn mt d liu
i vi dng nn c mt d liu, nh c khi phc khng ging hon ton vi nh gc, dng nn ny thch hp cho vic lu tr v truyn nh tnh, video qua mt mng c bng thng hn ch. Cc dng nn ny thng cho h s nn cao hn, n lin quan ti vic dng cc php bin i tn hiu t min ny sang min khc. Cc v d ca bin i c mt d liu gm: Differential Encoding, Discrete Cosine Transform(DCT), Vector Quantization, JPEG (Joint Photographic Experts Group) v MPEG (Motion Picture Experts Group).

Cc phng php nn nh c mt tn hiu gm c 4 bc

nh gc c bin i theo nhiu cch khc nhau. Vo nhng nm 1980, vic nn v gii nn tn hiu video da trn k thut DPCM (differential pulse code modulation) c CCITT chun ho theo tiu chun H.120. Cc phng php nn dng DPCM da trn nguyn tc pht hin s ging nhau v khc nhau gia cc im nh (pixels) gn nhau tm cch loi b cc thng tin tha. Tuy nhin, cht lng nh ng khng t c cc yu cu cn thit. ci thin cht lng nh ng m khng lm tng s lng bit so vi yu cu, k thut m ho chuyn sang dng cc php bin i m chng c th x l ng thi mt nhm cc pixels v ta c khi nim v cc b m ho trn cc khi (block-based codecs). i vi cc b m ho trn cc khi nh, mi im nh (pixel) s cn t hn 1 bit m ho.

Cc b m ho khi c th da trn hai nguyn tc bin i c bn: Discrete Cosine Transform (DCT) v Vector Quantization (VQ). DCT c dng bin i cc khi nh hai chiu c kch thc 8X8 t min khng gian sang min tn s. Bin i DCT l tng t nh bin i DFT (Discrete Fourier Trransform). Cc h s DCT nhn c s c lng t ho (Quantization) v m ho (Encode). Cc h s DCT nhn c s c lng t ho (Quantisation coding) thnh tp cc h s n gin hn na. Mc ch ca n l lm gim hn na s bit c trng cho mt h s. Ti b m ho s c mt bng m (code book) v bng cc ch s ni b, t c th chn c cc t m (code word) tng ng mt cch tt nht cho tp cc h s c to ra. Qu trnh lng t ho cng ng thi lm trn gi tr ca cc h s mc nh hn, y chnh l nguyn nhn gy ra mt tn hiu, tuy vy nh c khi phc t cht lng mc c th chp nhn c i vi ngi xem.

Trong phng php VQ, bc nh c chia thnh cc khi c kch thc c nh, mt bng m (code book) c xy dng vi cc ch s tng ng vi cc khi nh ny. Nh vy, thay cho vic phi truyn ln lt cc khi ca bc nh, ta ch cn truyn cc ch s tng ng ca cc khi nh hoc ch s tng ng gn ng nht so vi cc khi nh cn truyn. Hai phng php ny cho kt qu khng khc nhau nhiu v cht lng nn nh ng, tuy nhin ngy nay bin i DCT t ra c ng dng rng ri hn trong cc s nn v gii nn cc bc nh tnh (theo tiu chun JPEG) v x l nh ng (theo tiu chun ca MPEG).

NN TN HIU NH DNG MPEG


MPEG (Moving Picture Expert Group) c ra i vo nm 1988 nhm mc ch chun ho cho nn tn hiu m thanh v video. MPEG - 1 c th nn tn hiu video ti 1.5Mbit/s vi cht lng VHS v m thanh lp th (stereo audio) vi tc 192 bit/s. N c dng lu tr video v m thanh trn CD-ROM.

Vo nhng nm 1990, MPEG-2 ra i nhm p ng cc tiu chun nn video cho truyn hnh. MPEG-2 c kh nng m ho tn hiu truyn hnh tc 3-15Mbit/s v truyn hnh nt cao tc ti 15-30Mbit/s. MPEG-2 cho php m ho tn hiu video vi nhiu mc phn gii khc nhau, chng c kh nng p ng cho nhiu ng dng khc nhau. Nhiu thut ton tng ng vi nhiu cc ng dng khc nhau pht trin v c tp hp li thnh mt b tiu chun y ca MPEG. Vic p dng ton b cc c im ca chun MPEG-2 trong tt c cc b m ho v gii m l khng cn thit do s phc tp ca thit b cng nh s tn km v di thng ca ng truyn V vy trong hu ht cc trng hp ta ch s dng mt phn nht nh trong ton b cc c im ca chun MPEG-2, chng thng c gi l profiles v levels. Mt profile s xc nh mt thut ton (iu chnh bitstream v phn gii mu) v mt level s xc nh mt s tiu ch bt buc cho cc tham s ca bc nh (v d nh kch thc nh v s lng bit).

MPEG-4 tr thnh mt tiu chun cho nn nh k thut truyn hnh s, cc ng dng v ho v video tng tc hai chiu (games, videoconferencing) v cc ng dng multimedia tng tc hai chiu (World Wide Web hoc cc ng dng nhm phn pht d liu video nh truyn hnh cp, Internet video...) vo nm 1999. Ngy nay, MPEG-4 tr thnh mt tiu chun cng ngh trong qu trnh sn xut, phn phi v truy cp vo cc h thng video. N gp phn gii quyt vn v dung lng cho cc thit b lu tr, gii quyt vn v bng thng ca ng truyn tn hiu video hoc kt hp c hai vn trn.

MPEG khng phi l mt cng c nn n l m u im ca nn nh dng MPEG chnh l ch MPEG c mt tp hp cc cng c m ho chun, chng c th c kt hp vi nhau mt cch linh ng phc v cho mt lot cc ng dng khc nhau.

Nn MPEG l s kt hp hi ho ca bn k thut c bn: Tin x l (Preprocessing), on trc s chuyn ng ca cc frame b m ho (temporal prediction), b chuyn ng b gii m (motion compensation) v m lng t ho (quatisation coding). Cc b lc tin x l s lc ra nhng thng tin khng cn thit t tn hiu video v nhng thng tin kh m ho nhng khng quan trng cho s cm th ca mt ngi. K thut on chuyn ng da trn nguyn tc l cc nh trong chui video dng nh c lin quan mt thit vi nhau theo thi gian: Mi frame ti mt thi im nht nh s c nhiu kh nng ging vi cc frame ng ngay pha trc v ngay pha sau n. Cc b m ho s tin hnh qut ln lt tng phn nh trong mi frame gi l macro blocks, sau n s pht hin macro block no khng thay i t frame ny ti frame khc. B m ho s tin on trc s xut hin ca cc macro blocks khi bit v tr v hng chuyn ng ca n. Do ch nhng s thay i gia cc khi trong frame hin ti (motion compesated residual) v cc khi c tin on mi c truyn ti bn pha thu. Pha bn thu tc b gii m lu tr sn nhng thng tin m khng thay i t frame ny ti frame khc trong b nh m ca n v chng c dng in thm mt cch u n vo cc v tr trng trong nh c khi phc.

Nh chng ta u bit, nn tn hiu video c thc hin nh vic loi b c s d tha v khng gian (spatial coding) v thi gian (temporal coding). Trong MPEG, vic loi b d tha v thi gian (nn lin nh) c thc hin trc ht nh s dng cc tnh cht ging nhau gia cc nh lin tip (Inter-frame techniques). Chng ta c th s dng tnh cht ny to ra cc bc nh mi nh vo nhng thng tin t nhng nh gi trc n (predicted). Do vy pha b m ho, ta ch cn gi nhng bc nh c thay i so vi nhng nh trc, sau ta li dng phng php nn v khng gian loi b s d tha v khng gian trong chnh bc nh sai khc ny. Nn v khng gian da trn nguyn tc l pht hin s ging nhau ca cc im nh (pixels) ln cn nhau (Intra-frame coding techniques). JPEG ch p dng phng php nn theo khng gian v n c thit k x l v truyn cc nh tnh. Tuy nhin nn tn hiu theo phng php ca JPEG cng c th c dng nn cc bc nh mt cch c lp trong dy tn hiu video. ng dng ny thng c gi l JPEG ng (Motion JPEG). Trong mt chu k gi mt dy cc bc nh theo kiu JPEG ng, nh u tin c nn nh s loi b d tha v khng gian, sau cc nh tip theo c nn nh s loi b d tha v thi gian (nn lin nh). Qu trnh c lp i lp li cho mt dy cc bc nh trong tn hiu video.

Thut ton nn MPEG cng da trn php bin i DCT cho cc khi nh 8x8 picxels tm ra s tha v khng gian mt cch c hiu qu gia cc im nh trong cng mt bc nh. Tuy nhin, trong trng hp c mi tng quan cht ch gia cc im nh trong cc bc nh k tip nhau tc l trong trng hp hai bc nh lin tip c ni dung trng nhau, k thut Inter-frame coding techniques s c dng cng vi vic tin on s d tha v khng gian to thnh k thut tin on b chuyn ng gia cc bc nh (Motion compesated prediction between frames). Trong nhiu s nn MPEG, ngi ta thng kt hp c vic tin on b chuyn ng theo thi gian v php bin i thng tin theo khng gian t hiu qu nn cao (Hybrid DPCM/DCT coding of video).

Hu ht cc s nn MPEG u dng k thut ly mu b xung (Subsampling) v lng t ho (Quantization) trc khi m ho. Ly mu b xung nhm mc ch lm gim kch thc bc nh u vo theo c theo chiu ngang v chiu dc, nh vy s gim s lng cc im nh trc m ho. Cng nn nh rng trong mt s trng hp ngi ta cn ly mu b xung theo thi gian lm gim s lng cc bc nh trong dy nh trc khi m ho. y c xem nh l mt k thut rt c bn nhm loi b s d tha da vo kh nng lu nh ca mt ngi cm th. Thng thng, chng ta c th phn bit s thay i v sng ca nh (changes in Brightness) tt hn so vi s thay i v mu (Chromaticity changes). Do trc ht cc s nn MPEG s tin hnh chia bc nh thnh cc thnh phn Y (Luminance hay brightness plane) v UV (Chrominance hay color planes) tc l mt thnh phn v sng v hai thnh phn v mu. Cc tn hiu video thnh phn ny s c ly mu (samples) v s ho (digitised) to nn cc im nh ri rc theo t l 4 : 2 : 2 v 4 : 2 : 0.

K thut tin on b chuyn ng c s dng nh l mt trong nhng cng c mnh lm gim s d tha v khng gian gia cc bc nh. Khi nim v b chuyn ng l da trn s phn on hng chuyn ng ca cc bc nh tc l cc nh thnh phn trong dy video s c thay th gn ng. K thut tin on b chuyn ng gia cc bc nh c xem nh l bin php hn ch bt cc thng s ca chuyn ng bi vic dng cc vector chuyn ng m t s dch chuyn ca cc im nh. Kt qu tin on tt nht ca mt im nh l da trn s tin on b chuyn ng t mt bc nh m ho c truyn pha trc ca n. C hai thng s, sai s chuyn ng (bin ) v cc vectors chuyn ng (hng chuyn ng) u c truyn ti pha bn nhn. Tuy nhin do c mi quan h tng quan cht ch gia cc im nh v khng gian (trng v khng gian), mt vector chuyn ng c th c dng cho mt khi cc im nh gm cc pixels ln cn nhau (MPEG -1 v MPEG -2 dng cc khi 16 x16 pixels).

Trong MPEG-2, c nhiu phng php tin on s chuyn ng. V d mt khi nh c th c tin on xui t nhng nh c truyn trc n (Forward Predicted), c th on ngc t nhng nh truyn sau n (Backward Predicted) hoc theo c hai chiu (Bidirectionally Predicted). Cc phng php dng tin on cc khi trong cng mt nh cng c th khng ging nhau, chng c th thay i t khi n sang khi kia. Hn na, hai trng (fields) trong cng mt khi cng c th c tin on theo hai cch khc nhau dng cc vector c lp nhau hoc chng c th dng chung mt vector. i vi mi khi nh, b m ho s chn cc phng php tin on thch hp, c gng m bo cht lng nh tt nht khi c gii m trong iu kin yu cu kht khe v s bit. Cc thng s lin quan ti chn phng php tin on cng c truyn ti b gii m cng vi d on sai s nhm khi phc gn chnh xc nh gc.

Trong MPEG, c 3 kiu nh khc nhau c dng m ho cho cc khi nh. Kiu nh Intra (I-pictures) l nh c m ho mt cch c lp m khng cn tham kho ti cc nh khc. Hiu qu nn tn hiu t c do loi b s tha v khng gian m khng c yu t thi gian tham gia vo qu trnh. I-pictures c dng mt cch tun hon to thnh cc im ta cho dng d liu trong qu trnh gii m.

nh Predictive (P-pictures) c th s dng cc nh I hoc P ngay st pha trc n b chuyn ng v chnh n cng c th c dng tham kho cho vic tin on cc nh khc tip theo. Mi khi nh trong P-picture c th hoc c m theo kiu tin on (predicted) hoc c m mt cch c lp (intra-coded). Do s dng c nn theo khng gian v thi gian, hiu qu nn ca P-pictures c tng ln mt cch ng k so vi I-pictures.

nh Bidirectionally-Predictive pictures hay B- Pictures c th s dng cc nh I hoc P pha trc hoc pha sau n cho vic b chuyn ng v do vy cho kt qu nn cao nht. Mi khi trong B-pictures c th c tin on theo chiu ngc, xui, c hai hng hoc c m mt cch c lp. c th tin on ngc t mt bc nh pha sau n, b m ho s tin hnh sp xp li cc bc nh t th t xut hin mt cch t nhin sang mt th t khc ca cc nh trn ng truyn. Do vy t u ra ca b m ho, B-pictures c truyn sau cc nh dng tham kho pha trc v pha sau ca n. iu ny s to ra tr do phi sp xp li thng tin, tr ny ln hay nh l tu thuc vo s cc bc nh B-pictures lin tip nhau c truyn.

S CA B M HO V GII M DNG MPEG-2

M ho MPEG-2
D liu t cc khi nh (macroblocks) cn c m ho s c a n c b tr (Subtractor) v b on chuyn ng (Motion Estimator). B on chuyn ng s so snh cc khi nh mi c a vo ny vi cc khi nh c a vo trc v c lu li nh l cc nh dng tham kho (Reference Picture). Kt qu l b on chuyn ng s tm ra cc khi nh trong nh tham kho gn ging nht vi khi nh mi ny. B on chuyn ng sau s tnh ton vector chuyn ng (Motion Vector), vector ny s c trng cho s dch chuyn theo c hai chiu dc v ngang ca khi nh mi cn m ho so vi nh tham kho. Chng ta lu rng vector chuyn ng c phn gii bng mt na do thc hin qut xen k.

B on chuyn ng cng ng thi gi cc khi nh tham kho ny m chng thng c gi l cc khi tin on (Predicted macroblock) ti b tr tr vi khi nh mi cn m ho (thc hin tr tng im nh tng ng tc l Pixel by pixel). Kt qu l ta s c cc sai s tin on (Error Prediction) hoc tn hiu d, chng s c trng cho s sai khc gia khi nh cn tin on v khi nh thc t cn m ho.

Tn hiu d hay sai s tin on ny s c bin i DCT, cc h s nhn c sau bin i DCT s c lng t ho lm gim s lng cc bits cn truyn. Cc h s ny s c a ti b m ho Huffman, ti y s bits c trng cho cc h s tip tc c lm gim i mt cch ng k. D liu t u ra ca m ho Huffman s c kt hp vi vector chuyn ng v cc thng tin khc (thng tin v I, P, B pictures) gi ti b gii m

i vi trng hp P-pictures, cc h s DCT cng c a n b gii m ni b (nm ngay trong b m ho). Tn hiu d hay sai s tin on c bin i ngc li dng php bin i IDCT v c cng thm vo nh ng trc to nn nh tham kho (nh tin on). V d liu nh trong b m ho c gii m lun nh vo b gii m ni b ngay chnh bn trong b m ho, do ta c th thc hin thay i th t cc bc nh v dng cc phng php tin on nh trnh by trn.

TI LIU THAM KHO


[1]. P.N TUDOR, MPEG-2 Video Compression, Electronics and Communication Engineering Journal, December 1995. [2]. THOMAS SIKORA, MPEG-1 and MPEG-2 Digital Video Coding Standards, McGraw Hill Publishing Company. [3]. THOMAS SIKORA, MPEG-1 and MPEG-2 Digital Video Coding Standards, IEEE Signal Processing Magazine, tobe published. [4]. Introduction to MPEG, http://tektronix.com/Measurement/AppNotes/Mpegfund. [5]. KEVIN JEFFAY, The video Data Type, Coding and Compression Basics, http://www.cs.unc.edu/~jeffay/courses/comp249f99.

También podría gustarte