The flexible 32-point DCT architecture was developed based on the Adaptic CORDIC (ACor) algorithm. Six different versions were implemented based on the number of DCT-point, i.e., 8-point (8p), 16-point (16p), and 32-point (32p), and the number of ACor stages, i.e., 2-Stage (2S) and 3-Stage (3S). The designs were written in Verilog HDL and were built on an FPGA. The advantages of ACor-based DCT are employing CORDIC and Twisted-Adder (TA), and TA was proved to have the minimum adder-delay while providing good accuracy outcomes. Therefore, it can achieve higher speed, low latency, and lower logic resources. The DCT core is open-source and free. This core can be applied in DSP systems, robotics, communication systems, video compression and decompression with H.265 standard. The core was successully implemented on FPGA.
Point | Design | ALUTs | Fmax (MHz) |
---|---|---|---|
8 | 2S-8p-DCT | 431 | 216.88 |
8 | 3S-8p-DCT | 575 | 181.13 |
16 | 2S-16p-DCT | 1,615 | 153.87 |
16 | 3S-16p-DCT | 2,404 | 134.72 |
32 | 2S-32p-DCT | 6,503 | 116.13 |
32 | 3S-32p-DCT | 9,911 | 109.85 |
- Compression and decompession standard: H.265/HEVC
- Format: RGB565, 16-bit color depth
- Resolution: 800x480p
- Speed: 30fps