SwePub
Tyck till om SwePub Sök här!
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "WFRF:(Öberg Johnny) "

Sökning: WFRF:(Öberg Johnny)

  • Resultat 51-60 av 131
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
51.
  • Mand, Nowshad Painda, et al. (författare)
  • Artificial neural network emulation on NOC based multi-core FPGA platform
  • 2012
  • Ingår i: NORCHIP, 2012. - : IEEE. - 9781467322218 ; , s. 6403122-
  • Konferensbidrag (refereegranskat)abstract
    • With the emergence of Multi-Core platforms, brain emulation in the form of Artificial Neural Nets has been announced as one of the important key research area. However, due to large non-linear growth of inter-neuron connectivity, direct mapping of ANNs to silicon structures is very difficult due to communication bottleneck.
  •  
52.
  • Mand, N. P., et al. (författare)
  • Going for brain-scale integration - Using FPGAS, TSVs and NOC based artificial neural networks : A case study
  • 2014
  • Ingår i: 11th FPGAworld Conference - Academic Proceedings 2014, FPGAWorld 2014. - New York, NY, USA : ACM Digital Library. - 9781450331302
  • Konferensbidrag (refereegranskat)abstract
    • With better understanding of brain's massive parallel processing, brain-scale integration has been announced as one of the key research area in modern times and numerous efforts has been done to mimic such models. Multicore architectures, Network-On-Chip, 3D stacked ICs with TSVs, FPGA's growth beyond Moore's law and new design methodologies like high level synthesis will ultimately lead us toward single- and multi-chip solutions of Artificial Neural Net models comprising of millions or even more neurons per chip. Historically ANNs have been emulated as either software models, ASICs or a hybrid of both. Software models are very slow while ASICs based designs lacks plasticity. FPGA consumes a little more power but offer the flexibility of software and performance of ASICs along with basic requirement of plasticity in the form of reconfigurability. However, the traditional bottom up approach for building large ANN models is no more feasible and wiring along with memory becomes major bottlenecks when considering networks comprised of large number of neurons. The aim of this paper is to present a design space exploration of large-scale ANN models using a scalable NOC based architecture together with high level synthesis tools to explore the feasibility of implementing brain-scale ANNs on FPGAs using 3D stacked memory structures.
  •  
53.
  •  
54.
  • Meincke, Thomas, et al. (författare)
  • Globally asynchronous locally synchronous architecture for large high-performance ASICs
  • 1999
  • Ingår i: ; 2, s. 512-515
  • Konferensbidrag (refereegranskat)abstract
    • Clock nets are the major source of power consumption in large, high-performance ASICs and a design bottleneck when it comes to tolerable clock skew. A way to obviate the global clock net is to partition the design into large synchronous blocks each having its own clock. Data with other blocks is exchanged asynchronously using handshake signals. Adopting such a strategy requires a methodology that supports: 1) a partitioning method dividing a design into the number of synchronous blocks such that the gain due to global clock net removal exceeds the communication overhead and 2) synthesis of handshake protocols to implement the data transfer between synchronous blocks. We describe this methodology and present results of applying it to a realistic design done in 0.25 micron, ranging in operating frequencies from 20 MHz to 1 GHz. The results show that the net power savings compared to fully synchronous designs are on an average about 30%
  •  
55.
  •  
56.
  • Minhass, Wajid Hassan, et al. (författare)
  • Design and implementation of a plesiochronous multi-core 4x4 network-on-chip FPGA platform with MPI HAL support
  • 2009
  • Ingår i: 6th FPGAworld Conference, Academic Proceedings 2009. - New York, NY, USA : ACM. - 9781605588797 ; , s. 52-57
  • Konferensbidrag (refereegranskat)abstract
    • The Multi-Core NoC is a 4 by 4 Mesh NoC targeted for Altera FPGAs. It implements a deflective routing policy and is used to connect sixteen NIOS II processors. Each NIOS II is connected to the NoC via an address-mapped Resource Network Interface. The Multi-Core NoC is implemented on four separate Altera Stratix II FPGA boards, each hosting a Quad-Core NoC, which operates on a local 50 MHz clock. It has an onboard throughput of 650 Mbps (12.5 MFlit/s), and uses 28% of the LUs, 18% of the ALUTs, 22 % of the dedicated registers and 31% of the total memory blocks of a Stratix II FPGA. Asynchronous clock bridges, with a throughput of 50 Mbps (∼1MFlit/s), are used for the inter-board communication. Application programs use an MPI compatible Hardware Abstraction Layer (HAL) to communicate with the Resource Network Interface of the NoC. The RNI sets up message transfer, with a maximum length of 512 bytes, and sends flits with the size of 32 bit data plus 20 bit headers through the network. The MPI is the bottleneck of the system; it takes 46 us (43.4 kPackets/s) to send a minimum-sized packet through the protocol stack to a near neighbour and bounce it back to the original application. The bounce-back time for a far neighbour is 56 us.
  •  
57.
  • Minhass, Wajid Hassan, et al. (författare)
  • Implementation of a scalable, globally plesiochronous locally synchronous, off-chip NoC communication protocol
  • 2009
  • Ingår i: 2009 NORCHIP. - 9781424443109 ; , s. 1-5
  • Konferensbidrag (refereegranskat)abstract
    • Multiprocessor system-on-chip design (MPSoC) is becoming a regular feature of the embedded systems. Shared-bus systems hold many advantages, but they do not scale. Network on chip (NoC) offers a promising solution to the scalability problem by enhancing the topology design. However, standard NoCs are only scalable within a chip. To be able to build infinitely scalable structures, a method to enhance the NoC-grid off-chip is needed. In this paper, we present such a method. As a proof of concept, the protocol is implemented on a 4 by 4 Mesh NoC, with NIOS II CPU cores as nodes, partitioned across four separate Altera FPGA boards, each board hosting a Quad-Core (2x2) NoC, operating on a local 50 MHz clock. The inter-chip communication protocol uses asynchronous clock bridges, with a throughput of 50 Mbps (~1MFlit/s) and is completely scalable. The NoC has an onboard throughput of 650 Mbps (12.5 MFlit/s). Each Quad-Core uses 28% of the LUs, 18% of the ALUTs, 22 % of the dedicated registers and 31% of the total memory blocks of the Stratix II FPGAs. Application programs use an MPI compatible Hardware Abstraction Layer (HAL) to communicate with each other over the NoC.
  •  
58.
  •  
59.
  • Navas, Byron, et al. (författare)
  • Camera and LCM IP-Cores for NIOS SOPC System
  • 2009
  • Ingår i: 6th FPGAworld Conference, Academic Proceedings 2009. - New York : ACM. - 9781605588797 ; , s. 18-23
  • Konferensbidrag (refereegranskat)abstract
    • This paper presents the development of IP-Cores to integrate the Terasic DC2 Camera and LCM (LCD Module) daughter boards into an Altera Nios System, so that the image can be further processed by embedded software or custom hardware instructions. Among other challenges overcome during this work are clock-domain crossing, synchronizing FIFO design, variable and pipelined burst control, multi-masters contention for system memory and image frame buffer switching. In addition, we designed software device drivers, and API functions intended for graphics, image processing and video control; which are part of the IP deliverables. In a brief, this work describes some concepts and methodologies involved in the creation of IP-Cores for an Altera SOPC; it also presents the results of the designed CAM-IP and LCM-IP Cores working in an application demo, which constitutes a real solution and a reference design.
  •  
60.
  • Navas, Byron, 1969- (författare)
  • Cognitive and Self-Adaptive SoCs with Self-Healing Run-Time-Reconfigurable RecoBlocks
  • 2015
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • In contrast to classical Field-Programmable Gate Arrays (FPGAs), partial and run-time reconfigurable (RTR) FPGAs can selectively reconfigure partitions of its hardware almost immediately while it is still powered and operative. In this way, RTR FPGAs combine the flexibility of software with the high efficiency of hardware. However, their potential cannot be fully exploited due to the increased complexity of the design process, and the intricacy to generate partial reconfigurations. FPGAs are often seen as a single auxiliary area to accelerate algorithms for specific problems. However, when several RTR partitions are implemented and combined with a processor system, new opportunities and challenges appear due to the creation of a heterogeneous RTR embedded system-on-chip (SoC).The aim of this thesis is to investigate how the flexibility, reusability, and productivity in the design process of partial and RTR embedded SoCs can be improved to enable research and development of novel applications in areas such as hardware acceleration, dynamic fault-tolerance, self-healing, self-awareness, and self-adaptation. To address this question, this thesis proposes a solution based on modular reconfigurable IP-cores and design-and-reuse principles to reduce the design complexity and maximize the productivity of such FPGA-based SoCs. The research presented in this thesis found inspiration in several related topics and sciences such as reconfigurable computing, dependability and fault-tolerance, complex adaptive systems, bio-inspired hardware, organic and autonomic computing, psychology, and machine learning.The outcome of this thesis demonstrates that the proposed solution addressed the research question and enabled investigation in initially unexpected fields. The particular contributions of this thesis are: (1) the RecoBlock SoC concept and platform with its flexible and reusable array of RTR IP-cores, (2) a simplified method to transform complex algorithms modeled in Matlab into relocatable partial reconfigurations adapted to an improved RecoBlock IP-core architecture, (3) the self-healing RTR fault-tolerant (FT) schemes, especially the Upset-Fault-Observer (UFO) that reuse available RTR IP-cores to self-assemble hardware redundancy during runtime, (4) the concept of Cognitive Reconfigurable Hardware (CRH) that defines a development path to achieve self-adaptation and cognitive development, (5) an adaptive self-aware and fault-tolerant RTR SoC that learns to adapt the RTR FT schemes to performance goals under uncertainty using rule-based decision making, (6) a method based on online and model-free reinforcement learning that uses a Q-algorithm to self-optimize the activation of dynamic FT schemes in performance-aware RecoBlock SoCs.The vision of this thesis proposes a new class of self-adaptive and cognitive hardware systems consisting of arrays of modular RTR IP-cores. Such a system becomes self-aware of its internal performance and learns to self-optimize the decisions that trigger the adequate self-organization of these RTR cores, i.e., to create dynamic hardware redundancy and self-healing, particularly while working in uncertain environments.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 51-60 av 131
Typ av publikation
konferensbidrag (99)
rapport (14)
tidskriftsartikel (8)
doktorsavhandling (5)
licentiatavhandling (3)
bokkapitel (2)
visa fler...
visa färre...
Typ av innehåll
refereegranskat (106)
övrigt vetenskapligt/konstnärligt (24)
populärvet., debatt m.m. (1)
Författare/redaktör
Öberg, Johnny (129)
Hemani, Ahmed (41)
Jantsch, Axel (33)
Tenhunen, Hannu (17)
Kumar, Shashi (8)
O'Nils, Mattias (4)
visa fler...
Olsson, Thomas (3)
Nilsson, Peter (3)
Peng, Zebo (3)
Eles, Petru (3)
Wosinska, Lena (2)
Thylén, Lars (2)
Dubrova, Elena (2)
Zhang, J. (1)
Schreiner, S. (1)
Graham, D. (1)
Maleki, A (1)
Pereira, D. (1)
Törngren, Martin, 19 ... (1)
Perez, Jon (1)
Asplund, Fredrik, 19 ... (1)
Crespo, A (1)
Olsson, Johanna (1)
Magnusson, Mats, Pro ... (1)
Sangchoolie, Behrooz (1)
Agirre, J. A. (1)
Etxeberria, L. (1)
Barbosa, R. (1)
Basagiannis, S. (1)
Giantamidis, G. (1)
Bauer, T. (1)
Ferrari, E. (1)
Labayen Esnaola, M. (1)
Orani, V. (1)
Proença, J. (1)
Schlick, R. (1)
Smrčka, A. (1)
Tiberti, W. (1)
Tonetta, S. (1)
Bozzano, M. (1)
Yazici, A. (1)
Zheng, Li-Rong (1)
Tibert, Gunnar (1)
Cartmell, Matthew (1)
Olsson, Henrik (1)
Lu, Zhonghai (1)
Hessmo, Björn (1)
Herzog, Erik (1)
Kuchcinski, Krzyszto ... (1)
Plosila, Juha (1)
visa färre...
Lärosäte
Kungliga Tekniska Högskolan (128)
Mittuniversitetet (3)
Lunds universitet (2)
Mälardalens universitet (1)
RISE (1)
Språk
Engelska (131)
Forskningsämne (UKÄ/SCB)
Teknik (123)
Naturvetenskap (2)
Samhällsvetenskap (1)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy