
中国计算机学会青年科技论坛(YOCSEF)AC委员
中国计算机学会多媒体专委会委员
中国计算机学会人工智能与模式识别专委会委员
中国计算机学会学术工委委员
中国图像图形学会多媒体专委会委员
中国图像图形学会青年工委委员
陕西省信号处理学会理事
_____________________________________________________________________________________________________ |
|
![]() |
|
|
Arcimboldo-like Collage Using Internet Images ACM Transactions on Graphics, 2011, accepted Hua Huang, Lei Zhang, Hongchao Zhang |
|
| Collage is a composite artwork made from assemblage of different material forms. In this work, we present a novel approach for creating a fantastic collage artform, namely Arcimboldo-like collage, which represents an input image with multiple thematically-related cutouts from the filtered Internet images. Due to the massive data of Internet images, competent image cutouts can almost always be discovered to match the segmented components of the input image. The selected cutouts are purposefully arranged such that as a whole assembly, they can represent the input image with disguise in both shape and color; but separately, individual cutout is still recognizable as its own being. Experimental results and user study show that our algorithm can effectively produce the entertaining Arcimboldolike collages. | |
_____________________________________________________________________________________________________ |
|
![]() |
|
|
Web-Image Driven Best Views of 3D Shapes The Visual Computer. 2011, Accepted Hong Liu, Lei Zhang, Hua Huang |
|
| The rapid advance of the Internet provides available huge database of web images. In this paper, we introduce a novel approach for automatically computing the best views of 3D shapes based on their web images. Best view selection is generally an intuitive task of getting the most information of a 3D shape. The novelty of our approach is to directly explore human perception on observing 3D shapes from the relevant web images. Those images are captured from biased views of different people, thus sufficiently reflecting view choice when observing the 3D shapes. By collecting web images possibly captured from the similar views, the best view is selected as the one possessing the most web images. We experiment our method with the shapes in Princeton Shape Benchmark (PSB), as well make comparisons with traditional geometric descriptor based approaches. The results demonstrate that our method is not only robust but also able to produce more canonical views in accordance with human perception. | |
_____________________________________________________________________________________________________ |
|
![]() |
|
|
What’re in a scene? - Towards the Computational Best View of 3D Scene Advanced Science Letters, 2011, Accepted Lei Zhang, Hong Liu, Hua Huang |
|
| A 3D scene is usually composed of groups of multiple objects with different materials. From the best viewpoint, people should easily observe what are in the scene. In this paper, we present a method to automatically compute the best view for a 3D scene. By clustering on geometric features of different orders, a novel scene view descriptor is proposed to enable faithful measurement on the concurrent importance of scene objects. Then the view cost function based on the descriptor is optimized to select the best view from a set of candidate views. From the best view computed by our method, people can quickly get a complete picture on the target scene. We experiment our method with a series of 3D scene models from Google 3D Warehouse, and make comparisons with previous view selection methods as well as some user studies on the computed best views. The results demonstrate that our method is efficient and robust towards the computation of the best views of 3D scenes. | |
_____________________________________________________________________________________________________ |
|
|
Fast feature-based mode decision for 4×4 intra prediction in H.264/AVC Science in China Series F: Information Sciences. 2011, Accepted Wang Ping, Huang Hua, Tan Zheng |
|
_____________________________________________________________________________________________________ |
|
![]() |
|
|
RepSnapping: efficient image cutout for repeated scene elements Computer Graphics Forum. 2011, Accepted Hua Huang, Lei Zhang, Hong-Chao Zhang |
|
| Repeated scene elements are copious and ubiquitous in natural images. Cutout of those repeated elements usually involves tedious and laborious user interaction by previous image segmentation methods. In this paper, we present RepSnapping, a novel method oriented to cutout of repeated scene elements with much less user interaction. By exploring inherent similarity between repeated elements, a new optimization model is introduced to thread correlated elements in the segmentation procedure. The model proposed here enables efficient solution using maxflow/ min cut on an extended graph. Experiments indicate that RepSnapping facilitates cutout of repeated elements better than the state-of-the-art interactive image segmentation and repetition detection methods. | |
_____________________________________________________________________________________________________ |
|
![]() |
|
|
Fast Facial Image Super-resolution via Local Linear Transformations for Resource-limited Applications
IEEE Transactions on Circuits and Systems for Video Technology, 2011. Onlie Hua Huang, Ning Wu |
|
| Most of popular learning-based super-resolution (SR) approaches suffer from complicated learning structures and highly intensive computation, especially in resource-limited applications. We propose a novel frontal facial image SR approach by using multiple local linear transformations to approximate the non-linear mapping between low-resolution (LR) and highresolution (HR) images in the pixel domain. We adopt Procrustes analysis to obtain orthogonal matrices representing the learned linear transformations, which can not only well capture appearance variations in facial patches but also greatly simplify the transformation computation to matrices manipulation. An HR image can be directly reconstructed from a single LR image without need of the large training data, thus avoiding the use of a large redundant LR and HR patch database. Experimental results show that our approach is computationally fast as well the SR quality compares favorably with the state-of-the-art approaches from both subjective and objective evaluations. Besides, our approach is insensitive to the size of training data and robust to a wide range of facial variations like occlusions. More importantly, the proposed method is also much more effective than other comparative methods to reconstruct real world images captured from the Internet and webcams. | |
_____________________________________________________________________________________________________ |
|
![]() |
|
|
Internet image voting for best view selection of 3D shapes Computer Graphics International Conference. 2011 Hong Liu, Lei Zhang, Hua Huang |
|
| Based on the huge image database on the In- ternet, we introduce a new approach to automatically select the best view for a 3D shape by Internet image voting (IIV). The problem of selecting the best view is mainly considered to be a task of getting the most information of a 3D shape, which is much intuitive and deeply related to the nature of human perception. The novelty of our IIV approach is to encourage common hu- man perception to directly determine the best view. To collect the human perception on a given 3D shape, our approach rst searches relevant images on the Internet. By carefully ltering out some cluttered images, we ob- tain a database where the images are feasible to analyze as well possibly contains candidate views captured from dierent people. Then, for each sampled view of the 3D shape, images tally votes for the best matching view by considering both shape and saliency similarity. Finally, the candidate view with the most votes is selected as the best view. We experiment our method with the shapes in Princeton Shape Benchmark, and also make compar- isons with traditional geometric descriptor based ap- proaches. The results demonstrate that our method is not only robust but also able to produce better view in accordance with human perception. | |
_____________________________________________________________________________________________________ |
|
![]() |
|
|
Painterly rendering with content-dependent natural paint strokes
The Visual Computer. 2011, 27(9):861-871 Hua Huang, TianNan Fu |
|
| We present a new painterly rendering method that simulates artists’ content-dependent painting process and the natural variation of hand-painted strokes. First, a new stroke layout strategy is proposed to enhance the contrast between large and small paint strokes, which is an important characteristic of hand-painted paintings. Specifically, the input image is partitioned into nonuniform grids according to its importance map, and determined by the grid size, an individually constructed paint stroke is applied in each grid. Second, an anisotropic digital brush is designed to simulate a real paint brush. In particular, each bristle of the digital brush has an individual color, so that strokes rendered by the new brush can have multiple colors and naturally varied textures. Finally, we present a novel method to add lighting effects to the canvas. This lighting imitation method is robust and very easy to implement, and it can significantly improve the quality of rendering. Comparing with traditional painterly rendering approaches, the new method simulates more closely the real painting procedure, and our experimental results show that it can produce vivid paintings with fewer artifacts. | |
_____________________________________________________________________________________________________ |
|
|
A fast two-step block type decision algorithm for intra prediction in H.264/AVC high profile
Multimedia Tools and Applications. 2011, Online Ping Wang, Hua Huang, Zheng Tan |
|
| Intra coding in H.264/AVC can significantly improve the compression efficiency but at the cost of high computational complexity. To reduce the complexity, this paper presents an efficient block type decision algorithm for intra prediction. In H.264/AVC high profile, three kinds of block types are supported. This algorithm determines the optimal block type by two steps. The first step is based on the fact that the block type of intra prediction is highly dependent on the smoothness of macroblock. An edge-based feature is introduced to characterize the smoothness, by comparing it with two thresholds impossible block type is firstly filtered out. Then the second step is based on the correlation of block type chosen for different chroma modes. Experimental results show that the proposed fast algorithm can achieve 72.8% time saving on average for encoding the all intra-frame sequence with average 0.75% bit rate increase and 0.05dB PSNR degradation when comparing with the reference software. | |
_____________________________________________________________________________________________________ |
|
![]() |
|
|
EXCOL: an extract-and-complete approach to cartoon animation reusing
IEEE Transactions on Visualization and Computer Graphics. 2011, Online Lei Zhang, Hua Huang, Hong-Bo Fu |
|
| We introduce the EXCOL method (EXtract-and-COmplete Layering) — a novel cartoon animation processing technique to convert a traditional animated cartoon video into multiple semantically meaningful layers. Our technique is inspired by vision-based layering techniques but focuses on shape cues in both the extraction and completion steps to reflect the unique characteristics of cartoon animation. For layer extraction, we define a novel similarity measure incorporating both shape and color of automatically segmented regions within individual frames and propagate a small set of user-specified layer labels among similar regions across frames. By clustering regions with the same labels, each frame is appropriately partitioned into different layers, with each layer containing semantically meaningful content. Then a warping-based approach is used to fill missing parts caused by occlusion within the extracted layers to achieve a complete representation. EXCOL provides a flexible way to effectively reuse traditional cartoon animations with only a small amount of user interaction. It is demonstrated that our EXCOL method is effective and robust, and the layered representation benefits a variety of applications in cartoon animation processing. | |
_____________________________________________________________________________________________________ |
|
![]() |
|
|
Super-Resolution Method for Face Recognition Using Nonlinear Mappings on Coherent Features
IEEE Transaction on Neural Networks. 2011, 22(1): 121-130 Hua Huang, Huiting He |
|
| Low-resolution (LR) of face images significantly decreases the performance of face recognition. To address this problem, we present a super-resolution method that uses nonlinear mappings to infer coherent features that favor higher recognition of the nearest neighbor (NN) classifiers for recognition of single LR face image. Canonical correlation analysis is applied to establish the coherent subspaces between the principal component analysis (PCA) based features of high-resolution (HR) and LR face images. Then, a nonlinear mapping between HR/LR features can be built by radial basis functions (RBFs) with lower regression errors in the coherent feature space than in the PCA feature space. Thus, we can compute super-resolved coherent features corresponding to an input LR image according to the trained RBF model efficiently and accurately. And, face identity can be obtained by feeding these super-resolved features to a simple NN classifier. Extensive experiments on the Facial Recognition Technology, University of Manchester Institute of Science and Technology, and Olivetti Research Laboratory databases show that the proposed method outperforms the state-of-the-art face recognition algorithms for single LR image in terms of both recognition rate and robustness to facial variations of pose and expression. | |
| _____________________________________________________________________________________________________ |
_____________________________________________________________________________________________________ |
|
|
Manifold Learning for Visualizing and Analyzing High- dimensional Data IEEE Intelligent Systems. 2010, 25(4): 54-61 Junping Zhang, Hua Huang, Jue Wang |
|
_____________________________________________________________________________________________________ |
|
![]() |
|
|
Super-resolution of Human Face Image Using Canonical Correlation analysis
Pattern Recognition. 2010, 43(7): 2532-2543 Hua Huang, Huiting He, Xin Fan, Junping Zhang |
|
| Super-resolution reconstruction of face image is the problem of reconstructing a high resolution face image from one or more low resolution face images. Assuming that high and low resolution images share similar intrinsic geometries, various recent super-resolution methods reconstruct high resolution images based on a weights determined from nearest neighbors in the local embedding of low resolution images. These methods suffer disadvantages from the finite number of samples and the nature of manifold learning techniques, and hence yield unrealistic reconstructed images. To address the problem, we apply canonical correlation analysis (CCA), which maximizes the correlation between the local neighbor relationships of high and low resolution images. We use it separately for reconstruction of global face appearance, and facial details. Experiments using a collection of frontal human faces show that the proposed algorithm improves reconstruction quality over existing state-of-the-art super-resolution algorithms, both visually, and using a quantitative peak signal-to-noise ratio assessment. |
|
_____________________________________________________________________________________________________ |
|
![]() |
|
|
A Simple Approach to Multiview Face Hallucination IEEE Signal Processing Letter. 2010, 17(6): 579-582 Xiang Ma, Hua Huang, Shaopeng Wang, and Chun Qi |
|
| Most face hallucination methods are usually limited to frontal face with small pose variations. This letter presents a simple and efficient multiview face hallucination (MFH) method to generate high-resolution (HR) multiview faces from a single given low-resolution (LR) one. The problem is addressed in two steps. A simple face transformation method is proposed by defining a constrained least square problem for LR multiview face transformation and a position-patch based face hallucination method is extended to incorporate HR multiview face details. Experimental results show that our approach has some advantages over existing MFH methods. | |
_____________________________________________________________________________________________________ |
|
![]() |
|
|
Example-Based Painting Guided by Color Features
The Visual Computer. 2010, 26(6-8): 933 - 942 Hua Huang, Yu Zang, Chen-Feng Li |
|
| In this paper, by analyzing and learning the color features of the reference painting with a novel set of measures, an example-based approach is developed to transfer some key color features from the template to the source image. First, color features of a given template painting is analyzed in terms of hue distribution and the overall color tone. These features are then extracted and learned by the algorithm through an optimization scheme. Next, to ensure the spatial coherence of the final result, a segmentation based post processing is performed. Finally, a new color blending model, which avoids the dependence of edge detection and adjustment of inconvenient tune parameters, is developed to provide a flexible control for the accuracy of painting. Experimental results show that the new example-based painting system can produce paintings with specific color features of the template, and it can also be applied to changing color themes of art pieces, designing color styles of paintings/real images, and specific color harmonization. | |
_____________________________________________________________________________________________________ |
|
![]() |
|
|
Example-based contrast enhancement by gradient mapping
The Visual Computer. 2010, 26(6-8): 731-738 Hua Huang, Xuezhong Xiao |
|
| Contrast enhancement is a very important problem in image processing. The key issue is how to assign correct enhancement levels for the local regions in an image, which makes previous methods incur much artifacts, e.g., over-enhancement, halo. In this paper, an example-based contrast enhancement algorithm is proposed, which works in the gradient domain. We utilize GMM model to describe the gradient distribution of an image. Then a GMM-based gradient mapping method is proposed to transfer the gradient of a reference image to the source image. The enhanced image is obtained by solving a Poisson equation defined by the altered gradient. Experimental results show the effectiveness and robustness of our method. |
|
_____________________________________________________________________________________________________ |
|
![]() |
|
|
Mesh Reconstruction by Meshless Denoising and Parameterization
Computers & Graphics. 2010, 34(3): 198-208 Lei Zhang, Ligang Liu, Craig Gotsman, Hua Huang |
|
| We present a new approach to simultaneously denoise and parameterize unorganized point cloud data. This is achieved by minimizing an appropriate energy function defined on the denoised point cloud and its parameterization. An iterative algorithm to minimize the energy is described. The key ingredient of our approach is an ``as-rigid-as-possible" meshless parameterization to map a point cloud with disk topology to the plane without building the connectivity of the point cloud. Then 2D triangulation method can be applied to the planar parameterization to provide triangle connectivity for the 2D points, which can be transferred back to the 3D point cloud to form a triangle mesh surface. We also show how to generalize the approach to meshes with closed topology, of any genus. Experimental results have shown that our approach can effectively denoise the point cloud and our meshless parameterization can preserve local distances in the point cloud, resulting in a more regular 3D triangle mesh, compared to other methods. | |
_____________________________________________________________________________________________________ |
|
![]() |
|
|
RBF Network-Based Temporal Color Morphing
Journal of computer animation and virtual worlds. 2010, 21(3-4): 289-296, SCI UT ISI: 000280135400016 Xuezhong Xiao, Hua Huang, Lizhuang Ma |
|
| A method of RBF network-based temporal color morphing is proposed to simulate the natural phenomena characterized by temporal color alteration, e.g., turning green of foliage, resurgence of leaves. Such phenomena usually span a long time and it is very difficult to capture their whole process. Our system accepts a source image sequence and a reference image as input. The source sequence contains the desired scene except for color alteration, and the reference image has the color style which the source sequence is expected to advance into. First, an RBF network is employed to model the mapping between the colors of the source sequence and the reference image. Then, a simple interpolation algorithm is applied to render the resulting sequence. The effectiveness of the new method is verified by experiments. | |
| _____________________________________________________________________________________________________ |
_____________________________________________________________________________________________________ |
|
![]() |
|
|
实时图像素描风格化
计算机学报, 2009, 32(10): 2023-2029 黄华, 程威 |
|
| 提出了一种实时的图像素描风格化算法。根据素描以及人眼视觉的特点,利用人眼对颜色差异的敏感程 度来勾勒图像的轮廓,并依据图像光感强度产生的重要性图来控制渲染素描风格画的阴影纹理,使得生成的素描 风格画具有良好的艺术效果。与已有方法相比,该文算法不需要人工交互,生成的素描视觉效果更好,在速度上能 达到实时。 | |
_____________________________________________________________________________________________________ |
|
![]() |
|
|
Edge-Aware Level Set Diffusion and Bilateral Filtering Reconstruction for Image Magnification
Journal of Computer Science and Technology, 2009 Hua Huang, Yu Zang, Paul L. Rosin and Chun Qi |
|
| In this paper we propose an image magnification reconstruction method. In recent years many interpolation algorithms have been proposed for image magnification, but all of them have defects to some degree, such as jaggies and blurring. To solve these problems, we propose applying post-processing which consists of edge-aware level set diffusion and bilateral filtering. After the initial interpolation, the contours of the image are identified. Next, edge-aware level set diffusion is applied to these significant contours to remove the jaggies, followed by bilateral filtering at the same locations to reduce the blurring created by the initial interpolation and level set diffusion. These processes produce sharp contours without jaggies and preserve the details of the image. Results show that the overall RMS error of our method barely increases while the contour smoothness and sharpness are substantially improved. | |
_____________________________________________________________________________________________________ |
|
|
基于相似度加权的自适应HD算法
自动化学报, 2009, 35(7): 882-887 EI AN: 20093212244565 黄华, 颜恺, 齐春 |
|
| Hausdor® 距离(Hausdor® distance, HD) 是一种点集与点集之间的距离测度, 常用于目标物体的匹配、跟踪和识别等。 本文在分析经典HD 及改进算法的基础上, 提出了一种基于相似度加权的自适应HD (Adaptive Hausdar® distance,AHD) 算法。 AHD 算法利用不同点到点集的最小距离的个数作为匹配相似度的测量, 并舍弃对判断匹配几乎没有作用的较大的点到点集的最小距离值; 同时根据点到点集的最小距离自适应选择权值, 从而得到一种基于相似度测量加权系数; 通过利用部分点到点集的最小距离和基于相似度的加权平均, 既增强了算法的鲁棒性, 又尽可能地保证了算法的精度。 实验结果显示,AHD 算法在匹配准确性、抵抗噪声和遮挡干扰等方面性能良好。 | |
_____________________________________________________________________________________________________ |
|
![]() |
|
|
Real-Time Content-Aware Image Resizing
Science in China Series F: Information Sciences, 2009, 52(2):172-182 SCI IDS: 397ZZ Huang Hua, Fu TianNan, Rosin Paul L. & Qi Chun |
|
| Content-aware image resizing is a kind of new and effective approach for image resizing, which preserves image content well and does not cause obvious distortion when changing the aspect ratio of images. Recently, a seam based approach for content-aware image resizing was proposed by Avidan and Shamir. Their results are impressive, but because the method uses dynamic programming many times, it is slow. In this paper, we present a more efficient algorithm for seam based content-aware image resizing, which searches seams through establishing the matching relation between adjacent rows or columns. We give a linear algorithm to find the optimal matches within a weighted bipartite graph composed of the pixels in adjacent rows or columns. Therefore, our method is fast (e.g. our method needs only about 100 ms to reduce a 768×1024 image’s width to 1/3 while Avidan and Shamir’s method needs 12 s). This supports immediate image resizing whereas Avidan and Shamir’s method requires a more costly pre-processing step to enable subsequent real-time processing. A fast method such as the one proposed will be also needed for future real-time video resizing applications. | |
| _____________________________________________________________________________________________________ |
_____________________________________________________________________________________________________ |
|
|
基于粒子滤波的人脸图像超分辨率重建方法
软件学报, 2006, 17(12): 2529-2536 EI AN:070310372717 黄华, 樊鑫, 齐春, 朱世华 |
|
| 将人脸图像超分辨率重建描述为人脸混合模型的纹理和位置参数的贝叶斯概率估计问题,将超分辨率重建的图像配准和像素融合这两个过程置于统一的概率估计框架下,并利用基于粒子滤波的参数估计算法,同时估计纹理和位置参数,从而实现人脸图像的超分辨率重建。包含灰度和位置两种先验信息的人脸混合模型,同时用于超分辨率重建的两个过程中,提高了图像配准精度和重建算法的性能,避免了通常方法在获得准确鲁棒的运动场估计时需要清晰的高分辨图像,而获得清晰的高分辨图像时又需要准确鲁棒运动场估计的困境。正面人脸合成序列图像实验结果表明,该方法获得的重建结果较为理想。 | |
_____________________________________________________________________________________________________ |
|
|
A Hybrid Parallel Projection Approach to Object-based Image Restoration
Pattern Recognition Letters 2006, 27(10): 1045-1053 SCI IDS Number: 049LP, EI AN: 06199867532 Xin Fan, Hua Huang, Dequn Liang, Chun Qi |
|
| Approaches analyzing local characteristics of an image prevail in image restoration. However, they are less effective in cases of restoring images degraded by large size point spread functions (PSFs) and heavy noise. The recently proposed learning based approaches perform well on recovering details from images degraded by large size PSFs, yet involves complicated implementation process and high computational expense. In this paper, we propose a hybrid approach to object-based image restoration. This method incorporates common characteristics of images from a class of objects into image restoration. These characteristics are represented as deterministic sets built on principal component analysis (PCA) models. The sets are combined with the observation model represented via a Bayesian approach to constrain the solution. A parallel projection algorithm is also proposed to find the solution that satisfies all constraints. Experiments performed on frontal face images using the proposed approach show superior performance over those based on local analysis in the cases involving large size PSF and heavy noise degradation. Compared with learning based approaches, the proposed approach can be implemented with ease and the solution can be found with less complexity. | |
_____________________________________________________________________________________________________ |
|
|
基于识别的凸集投影人脸图像超分辨率重建
计算机研究与发展, 2005, 42(10):1718-1725 EI AN:05469481445 黄华, 樊鑫, 齐春, 朱世华 |
|
| 人脸图像的超分辨率重建在公安、视频监控等领域有重要应用价值。基于识别的思想,对人脸灰度图像进行统计分析,得到有关人脸灰度整体特征的先验知识,将其描述为属性集合,从而利用凸集投影算法进行超分辨率图像重建。实验结果表明,重建质量较为理想,与通常的超分辨率凸集投影重建方法相比,抑制噪声的能力有显著提高,重建质量改善明显,收敛速度加快,且易于计算和实现。 | |
_____________________________________________________________________________________________________ |
|
|
文本数字水印
中文信息学报, 2001, 15(5):52-57 黄华, 齐春, 李俊 |
|
| 目前数字水印技术的研究和文献主要集中在静止图像和视频的保护等方面,文本数字水印研究的很少.国内甚至还未见到文本数字水印的相关文献。而实际上.一些文本文档比图像、视频等更需要得到保护;数字文本的保护对互联网时代的政府工作和电子商务等也具有重要意义。本文主要介绍文本数字水印技术的基本思想和目前的研究状况,首先介绍了文本数字水印的嵌入与检测方法.然后分析了用于中文的文本数字水印的研究方向以及可能的应用前景。 | |
| _____________________________________________________________________________________________________ |