SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "WFRF:(Kahl Fredrik 1972) "

Sökning: WFRF:(Kahl Fredrik 1972)

  • Resultat 1-50 av 63
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Eriksson, Anders, 1972, et al. (författare)
  • Rotation Averaging and Strong Duality
  • 2018
  • Ingår i: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. - 1063-6919. - 9781538664209 ; , s. 127-135
  • Konferensbidrag (refereegranskat)abstract
    • In this paper we explore the role of duality principles within the problem of rotation averaging, a fundamental task in a wide range of computer vision applications. In its conventional form, rotation averaging is stated as a minimization over multiple rotation constraints. As these constraints are non-convex, this problem is generally considered challenging to solve globally. We show how to circumvent this difficulty through the use of Lagrangian duality. While such an approach is well-known it is normally not guaranteed to provide a tight relaxation. Based on spectral graph theory, we analytically prove that in many cases there is no duality gap unless the noise levels are severe. This allows us to obtain certifiably global solutions to a class of important non-convex problems in polynomial time. We also propose an efficient, scalable algorithm that out-performs general purpose numerical solvers and is able to handle the large problem instances commonly occurring in structure from motion settings. The potential of this proposed method is demonstrated on a number of different problems, consisting of both synthetic and real-world data.
  •  
2.
  • Alipoor, Mohammad, 1983, et al. (författare)
  • Optimal Experiment Design for Mono-Exponential Model Fitting: Application to Apparent Diffusion Coefficient Imaging
  • 2015
  • Ingår i: BioMed Research International. - : Hindawi Limited. - 2314-6133 .- 2314-6141. ; 2015
  • Tidskriftsartikel (refereegranskat)abstract
    • The mono-exponential model is widely used in quantitative biomedical imaging. Notable applications include apparent diffusion coefficient (ADC) imaging and pharmacokinetics.The application of ADC imaging to the detection of malignant tissue has in turn prompted several studies concerning optimal experiment design for mono-exponential model fitting. In this paper, we propose a new experiment design method that is based on minimizing the determinant of the covariance matrix of the estimated parameters (?-optimal design). In contrast to previous methods, ?-optimal design is independent of the imaged quantities. Applying this method to ADC imaging, we demonstrate its steady performance for the whole range of input variables (imaged parameters, number of measurements, range of ?-values). Using Monte Carlo simulations we show that the ?-optimal design outperforms existing experiment design methods in terms of accuracy and precision of the estimated parameters.
  •  
3.
  • Alipoor, Mohammad, 1983, et al. (författare)
  • Optimal Gradient Encoding Schemes for Diffusion Tensor and Kurtosis Imaging
  • 2016
  • Ingår i: IEEE transactions on Computational Imaging. - 2333-9403. ; 2:3, s. 375-391
  • Tidskriftsartikel (refereegranskat)abstract
    • Diffusion-derived parameters find application in characterizing pathological and developmental changes in living tissues. Robust estimation of these parameters is important because they are used for medical diagnosis. An optimal gradient encoding scheme (GES) is one that minimizes the variance of the estimated diffusion parameters. This paper proposes a method for optimal GES design for two diffusion models: high-order diffusion tensor (HODT) imaging and diffusion kurtosis imaging (DKI). In both cases, the optimal GES design problem is formulated as a D-optimal (minimum determinant) experiment design problem. Then, using convex relaxation, it is reformulated as a semidefinite programming problem. Solving these problems we show that: 1) there exists a D-optimal solution for DKI that is simultaneously D-optimal for second- and fourth-order diffusion tensor imaging (DTI); 2) the traditionally used icosahedral scheme is approximately D-optimal for DTI and DKI; 3) the proposed D-optimal design is rotation invariant; 4) the proposed method can be used to compute the optimal design ($b$ -values and directions) for an arbitrary number of measurements and shells; and 5) using the proposed method one can obtain uniform distribution of gradient encoding directions for a typical number of measurements. Importantly, these theoretical findings provide the first mathematical proof of the optimality of uniformly distributed GESs for DKI and HODT imaging. The utility of the proposed method is further supported by the evaluation results and comparisons with with existing methods.
  •  
4.
  • Alvén, Jennifer, 1989, et al. (författare)
  • A Deep Learning Approach to MR-less Spatial Normalization for Tau PET Images
  • 2019
  • Ingår i: Medical Image Computing and Computer Assisted Intervention : MICCAI 2019 - 22nd International Conference, Proceedings - MICCAI 2019 - 22nd International Conference, Proceedings. - Cham : Springer International Publishing. - 0302-9743 .- 1611-3349. - 9783030322458 - 9783030322441 ; 11765 LNCS, s. 355-363
  • Konferensbidrag (refereegranskat)abstract
    • The procedure of aligning a positron emission tomography (PET) image with a common coordinate system, spatial normalization, typically demands a corresponding structural magnetic resonance (MR) image. However, MR imaging is not always available or feasible for the subject, which calls for enabling spatial normalization without MR, MR-less spatial normalization. In this work, we propose a template-free approach to MR-less spatial normalization for [18F]flortaucipir tau PET images. We use a deep neural network that estimates an aligning transformation from the PET input image, and outputs the spatially normalized image as well as the parameterized transformation. In order to do so, the proposed network iteratively estimates a set of rigid and affine transformations by means of convolutional neural network regressors as well as spatial transformer layers. The network is trained and validated on 199 tau PET volumes with corresponding ground truth transformations, and tested on two different datasets. The proposed method shows competitive performance in terms of registration accuracy as well as speed, and compares favourably to previously published results.
  •  
5.
  • Alvén, Jennifer, 1989, et al. (författare)
  • Shape-aware multi-atlas segmentation
  • 2016
  • Ingår i: Proceedings - International Conference on Pattern Recognition. - 1051-4651. ; 0, s. 1101-1106
  • Konferensbidrag (refereegranskat)abstract
    • Despite of having no explicit shape model, multi-atlas approaches to image segmentation have proved to be a top-performer for several diverse datasets and imaging modalities. In this paper, we show how one can directly incorporate shape regularization into the multi-atlas framework. Unlike traditional methods, our proposed approach does not rely on label fusion on the voxel level. Instead, each registered atlas is viewed as an estimate of the position of a shape model. We evaluate and compare our method on two public benchmarks: (i) the VISCERAL Grand Challenge on multi-organ segmentation of whole-body CT images and (ii) the Hammers brain atlas of MR images for segmenting the hippocampus and the amygdala. For this wide spectrum of both easy and hard segmentation tasks, our experimental quantitative results are on par or better than state-of-the-art. More importantly, we obtain qualitatively better segmentation boundaries, for instance, preserving fine structures.
  •  
6.
  • Alvén, Jennifer, 1989, et al. (författare)
  • Überatlas: Fast and robust registration for multi-atlas segmentation
  • 2016
  • Ingår i: Pattern Recognition Letters. - : Elsevier BV. - 0167-8655. ; 80, s. 249-255
  • Tidskriftsartikel (refereegranskat)abstract
    • Multi-atlas segmentation has become a frequently used tool for medical image segmentation due to its outstanding performance. A computational bottleneck is that all atlas images need to be registered to a new target image. In this paper, we propose an intermediate representation of the whole atlas set – an überatlas – that can be used to speed up the registration process. The representation consists of feature points that are similar and detected consistently throughout the atlas set. A novel feature-based registration method is presented which uses the überatlas to simultaneously and robustly find correspondences and affine transformations to all atlas images. The method is evaluated on 20 CT images of the heart and 30 MR images of the brain with corresponding ground truth. Our approach succeeds in producing better and more robust segmentation results compared to three baseline methods, two intensity-based and one feature-based, and significantly reduces the running times.
  •  
7.
  • Alvén, Jennifer, 1989, et al. (författare)
  • Überatlas: Robust Speed-Up of Feature-Based Registration and Multi-Atlas Segmentation
  • 2015
  • Ingår i: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). - Cham : Springer International Publishing. - 1611-3349 .- 0302-9743. - 9783319196640 ; 9127, s. 92-102
  • Konferensbidrag (refereegranskat)abstract
    • Registration is a key component in multi-atlas approaches to medical image segmentation. Current state of the art uses intensitybased registration methods, but such methods tend to be slow, which sets practical limitations on the size of the atlas set. In this paper, a novel feature-based registration method for affine registration is presented. The algorithm constructs an abstract representation of the entire atlas set, an uberatlas, through clustering of features that are similar and detected consistently through the atlas set. This is done offline. At runtime only the feature clusters are matched to the target image, simultaneously yielding robust correspondences to all atlases in the atlas set from which the affine transformations can be estimated efficiently. The method is evaluated on 20 CT images of the heart and 30 MR images of the brain with corresponding gold standards. Our approach succeeds in producing better and more robust segmentation results compared to two baseline methods, one intensity-based and one feature-based, and significantly reduces the running times.
  •  
8.
  • Arnab, Anurag, et al. (författare)
  • Conditional Random Fields Meet Deep Neural Networks for Semantic Segmentation: Combining Probabilistic Graphical Models with Deep Learning for Structured Prediction
  • 2018
  • Ingår i: IEEE Signal Processing Magazine. - 1558-0792 .- 1053-5888. ; 35:1, s. 37-52
  • Tidskriftsartikel (refereegranskat)abstract
    • Semantic segmentation is the task of labeling every pixel in an image with a predefined object category. It has numerous applications in scenarios where the detailed understanding of an image is required, such as in autonomous vehicles and medical diagnosis. This problem has traditionally been solved with probabilistic models known as conditional random fields (CRFs) due to their ability to model the relationships between the pixels being predicted. However, deep neural networks (DNNs) recently have been shown to excel at a wide range of computer vision problems due to their ability to automatically learn rich feature representations from data, as opposed to traditional handcrafted features. The idea of combining CRFs and DNNs have achieved state-of-the-art results in a number of domains. We review the literature on combining the modeling power of CRFs with the representation-learning ability of DNNs, ranging from early work that combines these two techniques as independent stages of a common pipeline to recent approaches that embed inference of probabilistic models directly in the neural network itself. Finally, we summarize future research directions.
  •  
9.
  • Arvidsson, Jonathan, et al. (författare)
  • Image Fusion of Reconstructed Digital Tomosynthesis Volumes From a Frontal and a Lateral Acquisition
  • 2016
  • Ingår i: Radiation protection dosimetry. - : Oxford University Press (OUP). - 1742-3406 .- 0144-8420. ; 169:1-4, s. 410-415
  • Tidskriftsartikel (refereegranskat)abstract
    • Digital tomosynthesis (DTS) has been used in chest imaging as a low radiation dose alternative to computed tomography (CT). Traditional DTS shows limitations in the spatial resolution in the out-of-plane dimension. As a first indication of whether a dual-plane dual-view (DPDV) DTS data acquisition can yield a fair resolution in all three spatial dimensions, a manual registration between a frontal and a lateral image volume was performed. An anthropomorphic chest phantom was scanned frontally and laterally using a linear DTS acquisition, at 120 kVp. The reconstructed image volumes were resampled and manually co-registered. Expert radiologist delineations of the mediastinal soft tissues enabled calculation of similarity metrics in regard to delineations in a reference CT volume. The fused volume produced the highest total overlap, implying that the fused volume was a more isotropic 3D representation of the examined object than the traditional chest DTS volumes.
  •  
10.
  •  
11.
  • Brynte, Lucas, 1990, et al. (författare)
  • On the Tightness of Semidefinite Relaxations for Rotation Estimation
  • 2022
  • Ingår i: Journal of Mathematical Imaging and Vision. - : Springer Science and Business Media LLC. - 1573-7683 .- 0924-9907. ; 64:1, s. 57-67
  • Tidskriftsartikel (refereegranskat)abstract
    • Why is it that semidefinite relaxations have been so successful in numerous applications in computer vision and robotics for solving non-convex optimization problems involving rotations? In studying the empirical performance, we note that there are few failure cases reported in the literature, in particular for estimation problems with a single rotation, motivating us to gain further theoretical understanding. A general framework based on tools from algebraic geometry is introduced for analyzing the power of semidefinite relaxations of problems with quadratic objective functions and rotational constraints. Applications include registration, hand–eye calibration, and rotation averaging. We characterize the extreme points and show that there exist failure cases for which the relaxation is not tight, even in the case of a single rotation. We also show that some problem classes are always tight given an appropriate parametrization. Our theoretical findings are accompanied with numerical simulations, providing further evidence and understanding of the results.
  •  
12.
  • Brynte, Lucas, 1990, et al. (författare)
  • Pose Proposal Critic: Robust Pose Refinement by Learning Reprojection Errors
  • 2020
  • Ingår i: 31st British Machine Vision Conference, BMVC 2020.
  • Konferensbidrag (refereegranskat)abstract
    • In recent years, considerable progress has been made for the task of rigid object pose estimation from a single RGB-image, but achieving robustness to partial occlusions remains a challenging problem. Pose refinement via rendering has shown promise in order to achieve improved results, in particular, when data is scarce. In this paper we focus our attention on pose refinement, and show how to push the state-of-the-art further in the case of partial occlusions. The proposed pose refinement method leverages on a simplified learning task, where a CNN is trained to estimate the reprojection error between an observed and a rendered image. We experiment by training on purely synthetic data as well as a mixture of synthetic and real data. Current state-of-the-art results are outperformed for two out of three metrics on the Occlusion LINEMOD benchmark, while performing on-par for the final metric.
  •  
13.
  • Brynte, Lucas, 1990, et al. (författare)
  • Rigidity preserving image transformations and equivariance in perspective
  • 2023
  • Ingår i: Image analysis. - Cham : Springer Nature. - 9783031314384 - 9783031314377 ; 13886 LNCS, s. 59-76
  • Konferensbidrag (refereegranskat)abstract
    • We characterize the class of image plane transformations which realize rigid camera motions and call these transformations ‘rigidity preserving’. It turns out that the only rigidity preserving image transformations are homographies corresponding to rotating the camera. In particular, 2D translations of pinhole images are not rigidity preserving. Hence, when using CNNs for 3D inference tasks, it can be beneficial to modify the inductive bias from equivariance w.r.t. translations to equivariance w.r.t. rotational homographies. We investigate how equivariance with respect to rotational homographies can be approximated in CNNs, and test our ideas on 6D object pose estimation. Experimentally, we improve on a competitive baseline.
  •  
14.
  • Bylow, Erik, et al. (författare)
  • Minimizing the maximal rank
  • 2016
  • Ingår i: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. - 1063-6919. - 9781467388511 ; 2016-January, s. 5887-5895
  • Konferensbidrag (refereegranskat)abstract
    • In computer vision, many problems can be formulated as finding a low rank approximation of a given matrix. Ideally, if all elements of the measurement matrix are available, this is easily solved in the L2-norm using factorization. However, in practice this is rarely the case. Lately, this problem has been addressed using different approaches, one is to replace the rank term by the convex nuclear norm, another is to derive the convex envelope of the rank term plus a data term. In the latter case, matrices are divided into sub-matrices and the envelope is computed for each subblock individually. In this paper a new convex envelope is derived which takes all sub-matrices into account simultaneously. This leads to a simpler formulation, using only one parameter to control the trade-of between rank and data fit, for applications where one seeks low rank approximations of multiple matrices with the same rank. We show in this paper how our general framework can be used for manifold denoising of several images at once, as well as just denoising one image. Experimental comparisons show that our method achieves results similar to state-of-the-art approaches while being applicable for other problems such as linear shape model estimation.
  •  
15.
  • Bylow, Erik, et al. (författare)
  • Robust Camera Tracking by Combining Color and Depth Measurements
  • 2014
  • Ingår i: International Conference on Pattern Recognition. - 1051-4651.
  • Konferensbidrag (refereegranskat)abstract
    • One of the major research areas in computer vision is scene reconstruction from image streams. The advent of RGB-D cameras, such as the Microsoft Kinect, has lead to new possibilities for performing accurate and dense 3D reconstruction. There are already well-working algorithms to acquire 3D models from depth sensors, both for large and small scale scenes. However, these methods often break down when the scene geometry is not so informative, for example, in the case of planar surfaces. Similarly, standard image-based methods fail for texture-less scenes. We combine both color and depth measurements from an RGB-D sensor to simultaneously reconstruct both the camera motion and the scene geometry in a robust manner. Experiments on real data show that we can accurately reconstruct large-scale 3D scenes despite many planar surfaces.
  •  
16.
  • Bylow, Erik, et al. (författare)
  • Robust online 3D reconstruction combining a depth sensor and sparse feature points
  • 2016
  • Ingår i: Proceedings - International Conference on Pattern Recognition. - 1051-4651. ; 0, s. 3709-3714, s. 3709-3714
  • Konferensbidrag (refereegranskat)abstract
    • Online 3D reconstruction has been an active research area for a long time. Since the release of the Microsoft Kinect Camera and publication of KinectFusion [11] attention has been drawn how to acquire dense models in real-time. In this paper we present a method to make online 3D reconstruction which increases robustness for scenes with little structure information and little texture information. It is shown empirically that our proposed method also increases robustness when the distance between the camera positions becomes larger than what is commonly assumed. Quantitative and qualitative results suggest that this approach can handle situations where other well-known methods fail. This is important in, for example, robotics applications like when the camera position and the 3D model must be created online in real-time.
  •  
17.
  • Bökman, Georg, 1994, et al. (författare)
  • A case for using rotation invariant features in state of the art feature matchers
  • 2022
  • Ingår i: IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. - 2160-7516 .- 2160-7508. ; 2022-June, s. 5106-5115
  • Konferensbidrag (refereegranskat)abstract
    • The aim of this paper is to demonstrate that a state of the art feature matcher (LoFTR) can be made more robust to rotations by simply replacing the backbone CNN with a steerable CNN which is equivariant to translations and image rotations. It is experimentally shown that this boost is obtained without reducing performance on ordinary illumination and viewpoint matching sequences.
  •  
18.
  • Bökman, Georg, 1994, et al. (författare)
  • Investigating how ReLU-networks encode symmetries
  • 2023
  • Ingår i: Advances in Neural Information Processing Systems. - 1049-5258. ; 36
  • Konferensbidrag (refereegranskat)abstract
    • Many data symmetries can be described in terms of group equivariance and the most common way of encoding group equivariances in neural networks is by building linear layers that are group equivariant. In this work we investigate whether equivariance of a network implies that all layers are equivariant. On the theoretical side we find cases where equivariance implies layerwise equivariance, but also demonstrate that this is not the case generally. Nevertheless, we conjecture that CNNs that are trained to be equivariant will exhibit layerwise equivariance and explain how this conjecture is a weaker version of the recent permutation conjecture by Entezari et al. [2022]. We perform quantitative experiments with VGG-nets on CIFAR10 and qualitative experiments with ResNets on ImageNet to illustrate and support our theoretical findings. These experiments are not only of interest for understanding how group equivariance is encoded in ReLU-networks, but they also give a new perspective on Entezari et al.'s permutation conjecture as we find that it is typically easier to merge a network with a group-transformed version of itself than merging two different networks.
  •  
19.
  • Bökman, Georg, 1994, et al. (författare)
  • ZZ-Net: A Universal Rotation Equivariant Architecture for 2D Point Clouds
  • 2022
  • Ingår i: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. - : IEEE Computer Society. - 1063-6919. ; 2022-June, s. 10966-10975
  • Konferensbidrag (refereegranskat)abstract
    • In this paper, we are concerned with rotation equivariance on 2D point cloud data. We describe a particular set of functions able to approximate any continuous rotation equivariant and permutation invariant function. Based on this result, we propose a novel neural network architecture for processing 2D point clouds and we prove its universality for approximating functions exhibiting these symmetries. We also show how to extend the architecture to accept a set of 2D-2D correspondences as indata, while maintaining similar equivariance properties. Experiments are presented on the estimation of essential matrices in stereo vision.
  •  
20.
  • Chelani, Kunal, 1992, et al. (författare)
  • How privacy-preserving are line clouds? Recovering scene details from 3D lines
  • 2021
  • Ingår i: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. - 1063-6919. ; , s. 15663-15673
  • Konferensbidrag (refereegranskat)abstract
    • Visual localization is the problem of estimating the camera pose of a given image with respect to a known scene. Visual localization algorithms are a fundamental building block in advanced computer vision applications, including Mixed and Virtual Reality systems. Many algorithms used in practice represent the scene through a Structure-from-Motion (SfM) point cloud and use 2D-3D matches between a query image and the 3D points for camera pose estimation. As recently shown, image details can be accurately recovered from SfM point clouds by translating renderings of the sparse point clouds to images. To address the resulting potential privacy risks for user-generated content, it was recently proposed to lift point clouds to line clouds by replacing 3D points by randomly oriented 3D lines passing through these points. The resulting representation is unintelligible to humans and effectively prevents point cloud-to-image translation. This paper shows that a significant amount of information about the 3D scene geometry is preserved in these line clouds, allowing us to (approximately) recover the 3D point positions and thus to (approximately) recover image content. Our approach is based on the observation that the closest points between lines can yield a good approximation to the original 3D points. Code is available at https://github.com/kunalchelani/Line2Point.
  •  
21.
  • Chelani, Kunal, 1992, et al. (författare)
  • Privacy-Preserving Representations are not Enough: Recovering Scene Content from Camera Poses
  • 2023
  • Ingår i: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. - 1063-6919. ; 2023-June, s. 13132-13141
  • Konferensbidrag (refereegranskat)abstract
    • Visual localization is the task of estimating the camera pose from which a given image was taken and is central to several 3D computer vision applications. With the rapid growth in the popularity of AR/VR/MR devices and cloudbased applications, privacy issues are becoming a very important aspect of the localization process. Existing work on privacy-preserving localization aims to defend against an attacker who has access to a cloud-based service. In this paper, we show that an attacker can learn about details of a scene without any access by simply querying a localization service. The attack is based on the observation that modern visual localization algorithms are robust to variations in appearance and geometry. While this is in general a desired property, it also leads to algorithms localizing objects that are similar enough to those present in a scene. An attacker can thus query a server with a large enough set of images of objects, e.g., obtained from the Internet, and some of them will be localized. The attacker can thus learn about object placements from the camera poses returned by the service (which is the minimal information returned by such a service). In this paper, we develop a proof-of-concept version of this attack and demonstrate its practical feasibility. The attack does not place any requirements on the localization algorithm used, and thus also applies to privacy-preserving representations. Current work on privacy-preserving representations alone is thus insufficient.
  •  
22.
  • Eriksson, Anders, et al. (författare)
  • Rotation Averaging with the Chordal Distance: Global Minimizers and Strong Duality
  • 2021
  • Ingår i: IEEE Transactions on Pattern Analysis and Machine Intelligence. - 1939-3539 .- 0162-8828. ; 43:1, s. 256-268
  • Tidskriftsartikel (refereegranskat)abstract
    • In this paper we explore the role of duality principles within the problem of rotation averaging, a fundamental task in a wide range of applications. In its conventional form, rotation averaging is stated as a minimization over multiple rotation constraints. As these constraints are non-convex, this problem is generally considered challenging to solve globally. We show how to circumvent this difficulty through the use of Lagrangian duality. While such an approach is well-known it is normally not guaranteed to provide a tight relaxation. Based on spectral graph theory, we analytically prove that in many cases there is no duality gap unless the noise levels are severe. This allows us to obtain certifiably global solutions to a class of important non-convex problems in polynomial time. We also propose an efficient, scalable algorithm that outperforms general purpose numerical solvers by a large margin and compares favourably to current state-of-the-art. Further, our approach is able to handle the large problem instances commonly occurring in structure from motion settings and it is trivially parallelizable. Experiments are presented for a number of different instances of both synthetic and real-world data.
  •  
23.
  • Fejne, Frida, 1986, et al. (författare)
  • Multiatlas Segmentation Using Robust Feature-Based Registration
  • 2017
  • Ingår i: , Cloud-Based Benchmarking of Medical Image Analysis. - Cham : Springer International Publishing. - 9783319496429 ; , s. 203-218
  • Bokkapitel (övrigt vetenskapligt/konstnärligt)abstract
    • This paper presents a pipeline which uses a multiatlas approach for multiorgan segmentation in whole-body CT images. In order to obtain accurate registrations between the target and the atlas images, we develop an adapted feature-based method which uses organ-specific features. These features are learnt during an offline preprocessing step, and thus, the algorithm still benefits from the speed of feature-based registration methods. These feature sets are then used to obtain pairwise non-rigid transformations using RANSAC followed by a thin-plate spline refinement or NiftyReg. The fusion of the transferred atlas labels is performed using a random forest classifier, and finally, the segmentation is obtained using graph cuts with a Potts model as interaction term. Our pipeline was evaluated on 20 organs in 10 whole-body CT images at the VISCERAL Anatomy Challenge, in conjunction with the International Symposium on Biomedical Imaging, Brooklyn, New York, in April 2015. It performed best on majority of the organs, with respect to the Dice index.
  •  
24.
  • Flehr, Alison, et al. (författare)
  • Development of a novel method to measure bone marrow fat fraction in older women using high-resolution peripheral quantitative computed tomography
  • 2022
  • Ingår i: Osteoporosis International. - : Springer Science and Business Media LLC. - 0937-941X .- 1433-2965. ; 33:7, s. 1545-1556
  • Tidskriftsartikel (refereegranskat)abstract
    • Bone marrow adipose tissue (BMAT) has been implicated in a number of conditions associated with bone deterioration and osteoporosis. Several studies have found an inverse relationship between BMAT and bone mineral density (BMD), and higher levels of BMAT in those with prevalent fracture. Magnetic resonance imaging (MRI) is the gold standard for measuring BMAT, but its use is limited by high costs and low availability. We hypothesized that BMAT could also be accurately quantified using high-resolution peripheral quantitative computed tomography (HR-pQCT). Methods: In the present study, a novel method to quantify the tibia bone marrow fat fraction, defined by MRI, using HR-pQCT was developed. In total, 38 postmenopausal women (mean [standard deviation] age 75.9 [3.1] years) were included and measured at the same site at the distal (n = 38) and ultradistal (n = 18) tibia using both MRI and HR-pQCT. To adjust for partial volume effects, the HR-pQCT images underwent 0 to 10 layers of voxel peeling to remove voxels adjacent to the bone. Linear regression equations were then tested for different degrees of voxel peeling, using the MRI-derived fat fractions as the dependent variable and the HR-pQCT-derived radiodensity as the independent variables. Results: The most optimal HR-pQCT derived model, which applied a minimum of 4 layers of peeled voxel and with more than 1% remaining marrow volume, was able to explain 76% of the variation in the ultradistal tibia bone marrow fat fraction, measured with MRI (p < 0.001). Conclusion: The novel HR-pQCT method, developed to estimate BMAT, was able to explain a substantial part of the variation in the bone marrow fat fraction and can be used in future studies investigating the role of BMAT in osteoporosis and fracture prediction.
  •  
25.
  • Fredriksson, Johan, et al. (författare)
  • Efficient algorithms for robust estimation of relative translation
  • 2016
  • Ingår i: Image and Vision Computing. - : Elsevier BV. - 0262-8856. ; 52, s. 114-124
  • Tidskriftsartikel (refereegranskat)abstract
    • One of the key challenges for structure from motion systems in order to make them robust to failure is the ability to handle outliers among the correspondences. In this paper we present two new algorithms that find the optimal solution in the presence of outliers when the camera undergoes a pure translation. The first algorithm has polynomial-time computational complexity, independently of the amount of outliers. The second algorithm does not offer such a theoretical complexity guarantee, but we demonstrate that it is magnitudes faster in practice. No random sampling approaches such as RANSAC are guaranteed to find an optimal solution, while our two methods do. We evaluate and compare the algorithms both on synthetic and real experiments. We also embed the algorithms in a larger system, where we optimize for the rotation angle as well (the rotation axis is measured by other means). The experiments show that for problems with a large amount of outliers, the RANSAC estimates may deteriorate compared to our optimal methods.
  •  
26.
  • Fredriksson, Johan, et al. (författare)
  • Fast and Reliable Two-View Translation Estimation
  • 2014
  • Ingår i: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. - 1063-6919. - 9781479951178 ; , s. 1606-1612
  • Konferensbidrag (refereegranskat)abstract
    • It has long been recognized that one of the fundamental difficulties in the estimation of two-view epipolar geometry is the capability of handling outliers. In this paper, we develop a fast and tractable algorithm that maximizes the number of inliers under the assumption of a purely translating camera. Compared to classical random sampling methods, our approach is guaranteed to compute the optimal solution of a cost function based on reprojection errors and it has better time complexity. The performance is in fact independent of the inlier/outlier ratio of the data. This opens up for a more reliable approach to robust ego-motion estimation. Our basic translation estimator can be embedded into a system that computes the full camera rotation. We demonstrate the applicability in several difficult settings with large amounts of outliers. It turns out to be particularly well-suited for small rotations and rotations around a known axis (which is the case for cellular phones where the gravitation axis can be measured). Experimental results show that compared to standard RANSAC methods based on minimal solvers, our algorithm produces more accurate estimates in the presence of large outlier ratios.
  •  
27.
  • Fredriksson, Johan, et al. (författare)
  • Optimal relative pose with unknown correspondences
  • 2016
  • Ingår i: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. - 1063-6919. - 9781467388511 ; 2016-January, s. 1728-1736
  • Konferensbidrag (refereegranskat)abstract
    • Previous work on estimating the epipolar geometry of two views relies on being able to reliably match feature points based on appearance. In this paper, we go one step further and show that it is feasible to compute both the epipolar geometry and the correspondences at the same time based on geometry only. We do this in a globally optimal manner. Our approach is based on an efficient branch and bound technique in combination with bipartite matching to solve the correspondence problem. We rely on several recent works to obtain good bounding functions to battle the combinatorial explosion of possible matchings. It is experimentally demonstrated that more difficult cases can be handled and that more inlier correspondences can be obtained by being less restrictive in the matching phase.
  •  
28.
  • Iglesias, José Pedro Lopes, 1994, et al. (författare)
  • Global Optimality for Point Set Registration Using Semidefinite Programming
  • 2020
  • Ingår i: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. - 1063-6919. ; 2020, s. 8284-8292, s. 8284-8292
  • Konferensbidrag (refereegranskat)abstract
    • In this paper we present a study of global optimality conditions for Point Set Registration (PSR) with missing data. PSR is the problem of aligning multiple point clouds with an unknown target point cloud. Since non-linear rotation constraints are present the problem is inherently non-convex and typically relaxed by computing the Lagrange dual, which is a Semidefinite Program (SDP). In this work we show that given a local minimizer the dual variables of the SDP can be computed in closed form. This opens up the possibility of verifying the optimally, using the SDP formulation without explicitly solving it. In addition it allows us to study under what conditions the relaxation is tight, through spectral analysis. We show that if the errors in the (unknown) optimal solution are bounded the SDP formulation will be able to recover it.
  •  
29.
  • Jafarzadeh, Ara, et al. (författare)
  • CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization
  • 2021
  • Ingår i: Proceedings of the IEEE International Conference on Computer Vision. - 1550-5499. ; , s. 9825-9835
  • Konferensbidrag (refereegranskat)abstract
    • Visual localization is the problem of estimating the position and orientation from which a given image (or a sequence of images) is taken in a known scene. It is an important part of a wide range of computer vision and robotics applications, from self-driving cars to augmented/virtual reality systems. Visual localization techniques should work reliably and robustly under a wide range of conditions, including seasonal, weather, illumination and man-made changes. Recent benchmarking efforts model this by providing images under different conditions, and the community has made rapid progress on these datasets since their inception. However, they are limited to a few geographical regions and often recorded with a single device. We propose a new benchmark for visual localization in outdoor scenes, using crowd-sourced data to cover a wide range of geographical regions and camera devices with a focus on the failure cases of current algorithms. Experiments with state-of-the-art localization approaches show that our dataset is very challenging, with all evaluated methods failing on its hardest parts. As part of the dataset release, we provide the tooling used to generate it, enabling efficient and effective 2D correspondence annotation to obtain reference poses.
  •  
30.
  • Jiang, Fangyuan, et al. (författare)
  • A Combinatorial Approach to L1-Matrix Factorization
  • 2015
  • Ingår i: Journal of Mathematical Imaging and Vision. - : Springer Science and Business Media LLC. - 1573-7683 .- 0924-9907. ; 51:3, s. 430-441
  • Tidskriftsartikel (refereegranskat)abstract
    • Recent work on low-rank matrix factorization has focused on the missing data problem and robustness to outliers and therefore the problem has often been studied under the $L_1$-norm. However, due to the non-convexity of the problem, most algorithms are sensitive to initialization and tend to get stuck in a local optimum. In this paper, we present a new theoretical framework aimed at achieving optimal solutions to the factorization problem. We define a set of stationary points to the problem that will normally contain the optimal solution. It may be too time-consuming to check all these points, but we demonstrate on several practical applications that even by just computing a random subset of these stationary points, one can achieve significantly better results than current state of the art. In fact, in our experimental results we empirically observe that our competitors rarely find the optimal solution and that our approach is less sensitive to the existence of multiple local minima.
  •  
31.
  • Jimenez-del-Toro, Oscar, et al. (författare)
  • Cloud-Based Evaluation of Anatomical Structure Segmentation and Landmark Detection Algorithms : VISCERAL Anatomy Benchmarks
  • 2016
  • Ingår i: IEEE Transactions on Medical Imaging. - : Institute of Electrical and Electronics Engineers (IEEE). - 0278-0062 .- 1558-254X. ; 35:11, s. 2459-2475
  • Tidskriftsartikel (refereegranskat)abstract
    • Variations in the shape and appearance of anatomical structures in medical images are often relevant radiological signs of disease. Automatic tools can help automate parts of this manual process. A cloud-based evaluation framework is presented in this paper including results of benchmarking current state-of-the-art medical imaging algorithms for anatomical structure segmentation and landmark detection: the VISCERAL Anatomy benchmarks. The algorithms are implemented in virtual machines in the cloud where participants can only access the training data and can be run privately by the benchmark administrators to objectively compare their performance in an unseen common test set. Overall, 120 computed tomography and magnetic resonance patient volumes were manually annotated to create a standard Gold Corpus containing a total of 1295 structures and 1760 landmarks. Ten participants contributed with automatic algorithms for the organ segmentation task, and three for the landmark localization task. Different algorithms obtained the best scores in the four available imaging modalities and for subsets of anatomical structures. The annotation framework, resulting data set, evaluation setup, results and performance analysis from the three VISCERAL Anatomy benchmarks are presented in this article. Both the VISCERAL data set and Silver Corpus generated with the fusion of the participant algorithms on a larger set of non-manually-annotated medical images are available to the research community.
  •  
32.
  • Kahl, Fredrik, 1972, et al. (författare)
  • Good Features for Reliable Registration in Multi-Atlas Segmentation
  • 2015
  • Ingår i: CEUR Workshop Proceedings. - 1613-0073. ; 1390:January, s. 12-17
  • Konferensbidrag (refereegranskat)abstract
    • This work presents a method for multi-organ segmentation in whole-body CT images based on a multi-atlas approach. A robust and efficient feature-based registration technique is developed which uses sparse organ specific features that are learnt based on their ability to register different organ types accurately. The best fitted feature points are used in RANSAC to estimate an affine transformation, followed by a thin plate spline refinement. This yields an accurate and reliable nonrigid transformation for each organ, which is independent of initialization and hence does not suffer from the local minima problem. Further, this is accomplished at a fraction of the time required by intensity-based methods. The technique is embedded into a standard multi-atlas framework using label transfer and fusion, followed by a random forest classifier which produces the data term for the final graph cut segmentation. For a majority of the classes our approach outperforms the competitors at the VISCERAL Anatomy Grand Challenge on segmentation at ISBI 2015.
  •  
33.
  • Koutouzi, Giasemi, 1984, et al. (författare)
  • Performance of a feature-based algorithm for 3D-3D registration of CT angiography to cone-beam CT for endovascular repair of complex abdominal aortic aneurysms
  • 2018
  • Ingår i: BMC Medical Imaging. - : Springer Science and Business Media LLC. - 1471-2342. ; 18:1
  • Tidskriftsartikel (refereegranskat)abstract
    • Background: A crucial step in image fusion for intraoperative guidance during endovascular procedures is the registration of preoperative computed tomography angiography (CTA) with intraoperative Cone Beam CT (CBCT). Automatic tools for image registration facilitate the 3D image guidance workflow. However their performance is not always satisfactory. The aim of this study is to assess the accuracy of a new fully automatic, feature-based algorithm for 3D3D registration of CTA to CBCT. Methods: The feature-based algorithm was tested on clinical image datasets from 14 patients undergoing complex endovascular aortic repair. Deviations in Euclidian distances between vascular as well as bony landmarks were measured and compared to an intensity-based, normalized mutual information algorithm. Results: The results for the feature-based algorithm showed that the median 3D registration error between the anatomical landmarks of CBCT and CT images was less than 3mm. The feature-based algorithm showed significantly better accuracy compared to the intensity-based algorithm (p<0.001). Conclusion: A feature-based algorithm for 3D image registration is presented.
  •  
34.
  • Kuang, Yubin, et al. (författare)
  • Minimal solvers for relative pose with a single unknown radial distortion
  • 2014
  • Ingår i: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. - 1063-6919. - 9781479951178 ; , s. 33-40
  • Konferensbidrag (refereegranskat)abstract
    • In this paper, we study the problems of estimating relative pose between two cameras in the presence of radial distortion. Specifically, we consider minimal problems where one of the cameras has no or known radial distortion. There are three useful cases for this setup with a single unknown distortion: (i) fundamental matrix estimation where the two cameras are uncalibrated, (ii) essential matrix estimation for a partially calibrated camera pair, (iii) essential matrix estimation for one calibrated camera and one camera with unknown focal length. We study the parameterization of these three problems and derive fast polynomial solvers based on Gröbner basis methods. We demonstrate the numerical stability of the solvers on synthetic data. The minimal solvers have also been applied to real imagery with convincing results.
  •  
35.
  • Larsson, Måns, 1989, et al. (författare)
  • A cross-season correspondence dataset for robust semantic segmentation
  • 2019
  • Ingår i: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. - 1063-6919. ; 2019-June, s. 9524-9534
  • Konferensbidrag (refereegranskat)abstract
    • In this paper, we present a method to utilize 2D-2D point matches between images taken during different image conditions to train a convolutional neural network for semantic segmentation. Enforcing label consistency across the matches makes the final segmentation algorithm robust to seasonal changes. We describe how these 2D-2D matches can be generated with little human interaction by geometrically matching points from 3D models built from images. Two cross-season correspondence datasets are created providing 2D-2D matches across seasonal changes as well as from day to night. The datasets are made publicly available to facilitate further research. We show that adding the correspondences as extra supervision during training improves the segmentation performance of the convolutional neural network, making it more robust to seasonal changes and weather conditions.
  •  
36.
  • Larsson, Måns, 1989, et al. (författare)
  • A projected gradient descent method for crf inference allowing end-to-end training of arbitrary pairwise potentials
  • 2018
  • Ingår i: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). - Cham : Springer International Publishing. - 1611-3349 .- 0302-9743. - 9783319781983 ; 10746 LNCS, s. 564-579
  • Konferensbidrag (refereegranskat)abstract
    • Are we using the right potential functions in the Conditional Random Field models that are popular in the Vision community? Semantic segmentation and other pixel-level labelling tasks have made significant progress recently due to the deep learning paradigm. However, most state-of-the-art structured prediction methods also include a random field model with a hand-crafted Gaussian potential to model spatial priors, label consistencies and feature-based image conditioning. In this paper, we challenge this view by developing a new inference and learning framework which can learn pairwise CRF potentials restricted only by their dependence on the image pixel values and the size of the support. Both standard spatial and high-dimensional bilateral kernels are considered. Our framework is based on the observation that CRF inference can be achieved via projected gradient descent and consequently, can easily be integrated in deep neural networks to allow for end-to-end training. It is empirically demonstrated that such learned potentials can improve segmentation accuracy and that certain label class interactions are indeed better modelled by a non-Gaussian potential. In addition, we compare our inference method to the commonly used mean-field algorithm. Our framework is evaluated on several public benchmarks for semantic segmentation with improved performance compared to previous state-of-the-art CNN+CRF models.
  •  
37.
  • Larsson, Måns, 1989, et al. (författare)
  • Deepseg: Abdominal Organ Segmentation Using Deep Convolutional Neural Networks
  • 2016
  • Ingår i: SSBA.
  • Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract
    • A fully automatic method for abdominal organ segmentation is presented. The method uses a robust initialization step based on a multi-atlas approach where the center of the organ is estimated together with a region of interest surrounding the center. As a second step a convolutional neural network performing pixelwise classification is applied. The convolutional neural network consists of several full 3D convolutional layers and takes two input features, which are designed to ensure both local and global consistency. Despite limited training data, our preliminary experimental results are on par with state-of-the-art approaches that have been developed over many years. More specifically the method is applied to the MICCAI2015 challenge “Multi-Atlas Labeling Beyond the Cranial Vault” in the free competition for organ segmentation in the abdomen. It achieved the best resultsfor 3 out of the 13 organs with a total mean dice coefficient of 0.757 for all organs. Top score was achieved for the gallbladder, the aorta and the right adrenal gland.
  •  
38.
  • Larsson, Måns, 1989, et al. (författare)
  • Fine-Grained Segmentation Networks: Self-Supervised Segmentation for Improved Long-Term Visual Localization
  • 2019
  • Ingår i: Proceedings of the IEEE International Conference on Computer Vision. - 1550-5499. ; 2019-October:October, s. 31-41
  • Konferensbidrag (refereegranskat)abstract
    • Long-term visual localization is the problem of estimating the camera pose of a given query image in a scene whose appearance changes over time. It is an important problem in practice, for example, encountered in autonomous driving. In order to gain robustness to such changes, long-term localization approaches often use segmantic segmentations as an invariant scene representation, as the semantic meaning of each scene part should not be affected by seasonal and other changes. However, these representations are typically not very discriminative due to the limited number of available classes. In this paper, we propose a new neural network, the Fine-Grained Segmentation Network (FGSN), that can be used to provide image segmentations with a larger number of labels and can be trained in a self-supervised fashion. In addition, we show how FGSNs can be trained to output consistent labels across seasonal changes. We demonstrate through extensive experiments that integrating the fine-grained segmentations produced by our FGSNs into existing localization algorithms leads to substantial improvements in localization performance.
  •  
39.
  • Larsson, Måns, 1989, et al. (författare)
  • Max-margin learning of deep structured models for semantic segmentation
  • 2017
  • Ingår i: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). - Cham : Springer International Publishing. - 1611-3349 .- 0302-9743. - 9783319591285 ; 10270 LNCS, s. 28-40
  • Konferensbidrag (refereegranskat)abstract
    • During the last few years most work done on the task of image segmentation has been focused on deep learning and Convolutional Neural Networks (CNNs) in particular. CNNs are powerful for modeling complex connections between input and output data but lack the ability to directly model dependent output structures, for instance, enforcing properties such as smoothness and coherence. This drawback motivates the use of Conditional Random Fields (CRFs), widely applied as a post-processing step in semantic segmentation. In this paper, we propose a learning framework that jointly trains the parameters of a CNN paired with a CRF. For this, we develop theoretical tools making it possible to optimize a max-margin objective with back-propagation. The max-margin loss function gives the model good generalization capabilities. Thus, the method is especially suitable for applications where labelled data is limited, for example, medical applications. This generalization capability is reflected in our results where we are able to show good performance on two relatively small medical datasets. The method is also evaluated on a public benchmark (frequently used for semantic segmentation) yielding results competitive to state-of-the-art. Overall, we demonstrate that end-to-end max-margin training is preferred over piecewise training when combining a CNN with a CRF.
  •  
40.
  • Larsson, Måns, 1989, et al. (författare)
  • Revisiting Deep Structured Models for Pixel-Level Labeling with Gradient-Based Inference
  • 2018
  • Ingår i: SIAM Journal on Imaging Sciences. - 1936-4954. ; 11:4, s. 2610-2628
  • Tidskriftsartikel (refereegranskat)abstract
    • Semantic segmentation and other pixel-level labeling tasks have made significant progress recently due to the deep learning paradigm. Many state-of-the-art structured prediction methods also include a random field model with a hand-crafted Gaussian potential to model spatial priors and label consistencies and feature-based image conditioning. These random field models with image conditioning typically require computationally demanding filtering techniques during inference. In this paper, we present a new inference and learning framework which can learn arbitrary pairwise conditional random field (CRF) potentials. Both standard spatial and high-dimensional bilateral kernels are considered. In addition, we introduce a new type of potential function which is image-dependent like the bilateral kernel, but an order of magnitude faster to compute since only spatial convolutions are employed. It is empirically demonstrated that such learned potentials can improve segmentation accuracy and that certain label-class interactions are indeed better modeled by a non-Gaussian potential. Our framework is evaluated on several public benchmarks for semantic segmentation with improved performance compared to previous state-of-the-art CNN+CRF models.
  •  
41.
  • Larsson, Måns, 1989, et al. (författare)
  • Robust abdominal organ segmentation using regional convolutional neural networks
  • 2017
  • Ingår i: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). - Cham : Springer International Publishing. - 1611-3349 .- 0302-9743. - 9783319591285 ; 10270 LNCS, s. 41-52
  • Konferensbidrag (refereegranskat)abstract
    • A fully automatic system for abdominal organ segmentation is presented. As a first step, an organ localization is obtained via a robust and efficient feature registration method where the center of the organ is estimated together with a region of interest surrounding the center. Then, a convolutional neural network performing voxelwise classification is applied. The convolutional neural network consists of several full 3D convolutional layers and takes both low and high resolution image data as input, which is designed to ensure both local and global consistency. Despite limited training data, our experimental results are on par with state-of-the-art approaches that have been developed over many years. More specifically the method is applied to the MICCAI2015 challenge “Multi-Atlas Labeling Beyond the Cranial Vault” in the free competition for organ segmentation in the abdomen. It achieved the best results for 3 out of the 13 organs with a total mean Dice coefficient of 0.757 for all organs. Top scores were achieved for the gallbladder, the aorta and the right adrenal gland.
  •  
42.
  • Larsson, Måns, 1989, et al. (författare)
  • Robust Abdominal Organ Segmentation Using Regional Convolutional Neural Networks
  • 2018
  • Ingår i: Applied Soft Computing Journal. - : Elsevier BV. - 1568-4946. ; 70, s. 465-470
  • Tidskriftsartikel (refereegranskat)abstract
    • A fully automatic system for abdominal organ segmentation is presented. As a first step, an organ localization is obtained via a robust and efficient feature registration method where the center of the organ is estimated together with a region of interest surrounding the center. Then, a convolutional neural network performing voxelwise classification is applied. Two convolutional neural networks of different architecture are compared. The first one has a structure similar to networks used for classification and is applied using a sliding window approach. The second one has a structure allowing it to be applied in a fully convolutional manner reducing computation time. Despite limited training data, our experimental results are on par with state-of-the-art approaches that have been developed over many years. More specifically the method is applied to the MICCAI2015 challenge “Multi-Atlas Labeling Beyond the Cranial Vault” in the free competition for organ segmentation in the abdomen. The method performed well for both types of convolutional neural networks. For the fully convolutional network a mean Dice coefficient of 0.767 was achieved, for the network applied with sliding window this number was 0.757.
  •  
43.
  • Larsson, Viktor, et al. (författare)
  • A Simple Method for Subspace Estimation with Corrupted Columns
  • 2016
  • Ingår i: Proceedings of the IEEE International Conference on Computer Vision. - 1550-5499. - 9781467383905 ; 2016-February, s. 841-849
  • Konferensbidrag (refereegranskat)abstract
    • This paper presents a simple and effective way of solving the robust subspace estimation problem where the corruptions are column-wise. The method we present can handle a large class of robust loss functions and is simple to implement. It is based on Iteratively Reweighted Least Squares (IRLS) and works in an iterative manner by solving a weighted least-squares rank-constrained problem in every iteration. By considering the special case of column-wise loss functions, we show that each such surrogate problem admits a closed form solution. Unlike many other approaches to subspace estimation, we make no relaxation of the low-rank constraint and our method is guaranteed to produce a subspace estimate with the correct dimension. Subspace estimation is a core problem for several applications in computer vision. We empirically demonstrate the performance of our method and compare it to several other techniques for subspace estimation. Experimental results are given for both synthetic and real image data including the following applications: linear shape basis estimation, plane fitting and non-rigid structure from motion.
  •  
44.
  • Larsson, Viktor, et al. (författare)
  • Outlier rejection for absolute pose estimation with known orientation
  • 2016
  • Ingår i: British Machine Vision Conference 2016, BMVC 2016. ; 2016-September, s. 45.1-45.12
  • Konferensbidrag (refereegranskat)abstract
    • Estimating the pose of a camera is a core problem in many geometric vision applications. While there has been much progress in the last two decades, the main difficulty is still dealing with data contaminated by outliers. For many scenes, e.g. with poor lightning conditions or repetitive textures, it is common that most of the correspondences are outliers. For real applications it is therefore essential to have robust estimation methods. In this paper we present an outlier rejection method for absolute pose estimation. We focus on the special case when the orientation of the camera is known. The problem is solved by projecting to a lower dimensional subspace where we are able to efficiently compute upper bounds on the maximum number of inliers. The method guarantees that only correspondences which cannot belong to an optimal pose are removed. In a number of challenging experiments we evaluate our method on both real and synthetic data and show improved performance compared to competing methods.
  •  
45.
  • Larsson, Viktor, et al. (författare)
  • Rank Minimization with Structured Data Patterns
  • 2014
  • Ingår i: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). - Cham : Springer International Publishing. - 1611-3349 .- 0302-9743. - 9783319105772 ; 8691:PART 3, s. 250-265
  • Konferensbidrag (refereegranskat)abstract
    • The problem of finding a low rank approximation of a given measurement matrix is of key interest in computer vision. If all the elements of the measurement matrix are available, the problem can be solved using factorization. However, in the case of missing data no satisfactory solution exists. Recent approaches replace the rank term with the weaker (but convex) nuclear norm. In this paper we show that this heuristic works poorly on problems where the locations of the missing entries are highly correlated and structured which is a common situation in many applications. Our main contribution is the derivation of a much stronger convex relaxation that takes into account not only the rank function but also the data. We propose an algorithm which uses this relaxation to solve the rank approximation problem on matrices where the given measurements can be organized into overlapping blocks without missing data. The algorithm is computationally efficient and we have applied it to several classical problems including structure from motion and linear shape basis estimation. We demonstrate on both real and synthetic data that it outperforms state-of-the-art alternatives.
  •  
46.
  • Lindgren Belal, Sarah, et al. (författare)
  • 3D skeletal uptake of F-18 sodium fluoride in PET/CT images is associated with overall survival in patients with prostate cancer
  • 2017
  • Ingår i: EJNMMI Research. - : Springer Science and Business Media LLC. - 2191-219X. ; 7:1
  • Tidskriftsartikel (refereegranskat)abstract
    • Background: Sodium fluoride (NaF) positron emission tomography combined with computer tomography (PET/CT) has shown to be more sensitive than the whole-body bone scan in the detection of skeletal uptake due to metastases in prostate cancer. We aimed to calculate a 3D index for NaF PET/CT and investigate its correlation to the bone scan index (BSI) and overall survival (OS) in a group of patients with prostate cancer. Methods: NaF PET/CT and bone scans were studied in 48 patients with prostate cancer. Automated segmentation of the thoracic and lumbar spines, sacrum, pelvis, ribs, scapulae, clavicles, and sternum were made in the CT images. Hotspots in the PET images were selected using both a manual and an automated method. The volume of each hotspot localized in the skeleton in the corresponding CT image was calculated. Two PET/CT indices, based on manual (manual PET index) and automatic segmenting using a threshold of SUV 15 (automated PET15 index), were calculated by dividing the sum of all hotspot volumes with the volume of all segmented bones. BSI values were obtained using a software for automated calculations. Results: BSI, manual PET index, and automated PET15 index were all significantly associated with OS and concordance indices were 0.68, 0.69, and 0.70, respectively. The median BSI was 0.39 and patients with a BSI > 0.39 had a significantly shorter median survival time than patients with a BSI 0.53 had a significantly shorter median survival time than patients with a manual PET index 0.11 had a significantly shorter median survival time than patients with an automated PET15 index
  •  
47.
  • Nasihatkon, Seyed Behrooz, 1983, et al. (författare)
  • Globally optimal rigid intensity based registration: A fast fourier domain approach
  • 2016
  • Ingår i: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. - 1063-6919. - 9781467388511 ; 2016-January, s. 5936-5944
  • Konferensbidrag (refereegranskat)abstract
    • High computational cost is the main obstacle to adapting globally optimal branch-and-bound algorithms to intensity-based registration. Existing techniques to speed up such algorithms use a multiresolution pyramid of images and bounds on the target function among different resolutions for rigidly aligning two images. In this paper, we propose a dual algorithm in which the optimization is done in the Fourier domain, and multiple resolution levels are replaced by multiple frequency bands. The algorithm starts by computing the target function in lower frequency bands and keeps adding higher frequency bands until the current subregion is either rejected or divided into smaller areas in a branch and bound manner. Unlike spatial multiresolution approaches, to compute the target function for a wider frequency area, one just needs to compute the target in the residual bands. Therefore, if an area is to be discarded, it performs just enough computations required for the rejection. This property also enables us to use a rather large number of frequency bands compared to the limited number of resolution levels used in the space domain algorithm. Experimental results on real images demonstrate considerable speed gains over the space domain method in most cases.
  •  
48.
  • Nasihatkon, Seyed Behrooz, 1983, et al. (författare)
  • Multiresolution Search of the Rigid Motion Space for Intensity-Based Registration
  • 2018
  • Ingår i: IEEE Transactions on Pattern Analysis and Machine Intelligence. - 1939-3539 .- 0162-8828. ; 40:1, s. 179-191
  • Tidskriftsartikel (refereegranskat)abstract
    • We study the relation between the correlation-based target functions of low-resolution and high-resolution intensity-based registration for the class of rigid transformations. Our results show that low-resolution target values can tightly bound the high-resolution target function in natural images. This can help with analyzing and better understanding the process of multiresolution image registration. It also gives a guideline for designing multiresolution algorithms in which the search space in higher resolution registration is restricted given the fitness values for lower resolution image pairs. To demonstrate this, we incorporate our multiresolution technique into a Lipschitz global optimization framework. We show that using the multiresolution scheme can result in large gains in the efficiency of such algorithms. The method is evaluated by applying to the problems of 2D registration, 3D rotation search, and the detection of reflective symmetry in 2D and 3D images.
  •  
49.
  • Norlén, Alexander, 1988, et al. (författare)
  • Automatic pericardium segmentation and quantification of epicardial fat from computed tomography angiography
  • 2016
  • Ingår i: Journal of Mecial Imaging. - 2329-4302 .- 2329-4310. ; 3:3
  • Tidskriftsartikel (refereegranskat)abstract
    • Recent findings indicate a strong correlation between the risk of future heart disease and the volume ofadipose tissue inside of the pericardium. So far, large-scale studies have been hindered by the fact that manual delin-eation of the pericardium is extremely time-consuming and that existing methods for automatic delineation strugglewith accuracy. In this paper, an efficient and fully automatic approach to pericardium segmentation and epicardial fatvolume estimation is presented, based on a variant of multi-atlas segmentation for spatial initialization and a randomforest classifier for accurate pericardium detection. Experimental validation on a set of 30 manually delineated Com-puter Tomography Angiography (CTA) volumes shows a significant improvement on state-of-the-art in terms of EFVestimation (mean absolute epicardial fat volume difference: 3.8 ml (4.7%), Pearson correlation: 0.99) with run-timessuitable for large-scale studies (52 s). Further, the results compare favorably to inter-observer variability measured on10 volumes.
  •  
50.
  • Olsson, Carl, 1978, et al. (författare)
  • A quasiconvex formulation for radial cameras
  • 2021
  • Ingår i: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. - 1063-6919. ; , s. 14571-14580
  • Konferensbidrag (refereegranskat)abstract
    • In this paper we study structure from motion problems for 1D radial cameras. Under this model the projection of a 3D point is a line in the image plane going through the principal point, which makes the model invariant to radial distortion and changes in focal length. It can therefore effectively be applied to uncalibrated image collections without the need for explicit estimation of camera intrinsics. We show that the reprojection errors of 1D radial cameras are examples of quasiconvex functions. This opens up the possibility to solve a general class of relevant reconstruction problems globally optimally using tools from convex optimization. In fact, our resulting algorithm is based on solving a series of LP problems. We perform an extensive experimental evaluation, on both synthetic and real data, showing that a whole class of multiview geometry problems across a range of different cameras models with varying and unknown intrinsic calibration can be reliably and accurately solved within the same framework.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-50 av 63
Typ av publikation
konferensbidrag (41)
tidskriftsartikel (21)
bokkapitel (1)
Typ av innehåll
refereegranskat (59)
övrigt vetenskapligt/konstnärligt (4)
Författare/redaktör
Kahl, Fredrik, 1972 (63)
Enqvist, Olof, 1981 (12)
Larsson, Viktor (10)
Toft, Carl, 1990 (10)
Alvén, Jennifer, 198 ... (9)
Larsson, Måns, 1989 (9)
visa fler...
Hammarstrand, Lars, ... (8)
Olsson, Carl (7)
Fredriksson, Johan (7)
Sattler, Torsten, 19 ... (6)
Olsson, Carl, 1978 (6)
Pollefeys, Marc (5)
Fejne, Frida, 1986 (5)
Stenborg, Erik, 1980 (5)
Ulén, Johannes (4)
Sattler, Torsten (4)
Brynte, Lucas, 1990 (4)
Bökman, Georg, 1994 (4)
Bylow, Erik (4)
Trägårdh, Elin (3)
Edenbrandt, Lars, 19 ... (3)
Starck, Göran (3)
Landgren, Matilda (3)
Norlén, Alexander, 1 ... (3)
Kaboteh, Reza (3)
Arnab, Anurag (3)
Zheng, Shuai (3)
Sadik, May, 1970 (3)
Ohlsson, Mattias (2)
Arvidsson, Jonathan (2)
Maier, Stephan E, 19 ... (2)
Pajdla, Tomas (2)
Gu, Irene Yu-Hua, 19 ... (2)
Olsson, Claes (2)
Alipoor, Mohammad, 1 ... (2)
Mehnert, Andrew, 196 ... (2)
Båth, Magnus, 1974 (2)
Svensson, Lennart, 1 ... (2)
Torr, Philip H.S. (2)
Poulsen, Mads (2)
Høilund-Carlsen, Pou ... (2)
Iglesias, José Pedro ... (2)
Chelani, Kunal, 1992 (2)
Sivic, Josef (2)
Torii, Akihiko (2)
Chin, Tat-Jun (2)
Yuhang, Zhang, 1982 (2)
Polymeri, Eirini (2)
Simonsen, Jane Angel (2)
Hasani, N. (2)
visa färre...
Lärosäte
Chalmers tekniska högskola (62)
Lunds universitet (28)
Göteborgs universitet (8)
Umeå universitet (2)
Kungliga Tekniska Högskolan (1)
Språk
Engelska (63)
Forskningsämne (UKÄ/SCB)
Naturvetenskap (54)
Teknik (49)
Medicin och hälsovetenskap (10)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy