SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "WFRF:(Öberg Johnny) ;pers:(Nilsson Erland)"

Sökning: WFRF:(Öberg Johnny) > Nilsson Erland

  • Resultat 1-6 av 6
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Nilsson, Erland, 1977- (författare)
  • Exploring trade-offs between Latency and Throughput in the Nostrum Network on Chip
  • 2006
  • Licentiatavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • During the past years has the Nostrum Network on Chip (NoC) been developed to become a competitive platform for network based on-chip communication. The Nostrum NoC provides a versatile communication platform to connect a large number of intellectual properties (IP) on a single chip. The communication is based on a packet switched network which aims for a small physical footprint while still providing a low communication overhead. To reduce the communication network size, a queue-less network has been the research focus. This network uses de ective hot-potato routing which is implemented to perform routing decisions in a single clock cycle. Using a platform like this results in increased design reusability, validated signal integrity, and well developed test strategies, in contrast to a fully customised designs which can have a more optimal communication structure but has a significantly longer development cycle to verify the new design accordingly. Several factors are considered when designing a communication platform. The goal is to create a platform which provides low communication latency, high throughput, low power consumption, small footprint, and low design, verification, and test overhead. Proximity Congestion Awareness is one technique that serves to reduce the network load. This leads to that the latency is reduced which also increases the network throughput. Another technique is to implement low latency paths called Data Motorways achieved through a clocking method called Globally Pseudochronous Locally Synchronous clocking. Furthermore, virtual circuits can be used to provide guarantees on latency and throughput. Such guarantees are dificult in hot-potato networks since network access has to be ensured. A technique that implements virtual circuits use looped containers that are circulating on a predefined circuit. Several overlapping virtual circuits are possible by allocating the virtual circuits in different Temporally Disjoint Networks. This thesis summarise the impact the presented techniques and methods have on the characteristics on the Nostrum model. It is possible to reduce the network load by a factor of 20 which reduces the communication latency. This is done by distributing load information between the Switches in the network. Data Motorways can reduce the communication latency with up to 50% in heavily loaded networks. Such latency reduction results in freed buffer space in the Switch registers which allows the traffic rate to be increased with about 30%.
  •  
2.
  •  
3.
  • Nilsson, Erland, et al. (författare)
  • Load Distribution with the Proximity Congestion Awareness in a Network on Chip
  • 2003
  • Ingår i: Design, Automation And Test In Europe Conference And Exhibition, Proceedings. - LOS ALAMITOS, USA : IEEE COMPUTER SOC. - 0769518702 ; , s. 1126-1127
  • Konferensbidrag (refereegranskat)abstract
    • In Networks on Chip, NoC, very low cost and high performance switches will be of critical importance. For a regular two-dimensional NoC we propose a very simple, memoryless switch. In case of congestion, packets are emitted in a non-ideal direction, also called deflective routing. To increase the maximum tolerable load of the network, we propose a Proximity Congestion Awareness, PCA, technique, where switches use load information of neighbouring switches, called stress values, for their own switching decisions, thus avoiding congested areas. We present simulation results with random traffic which show that the PCA technique can increase the maximum traffic load by a factor of over 20.
  •  
4.
  •  
5.
  • Nilsson, Erland, et al. (författare)
  • Reducing Power and Latency in 2-D Mesh NoCs using Globally Pseudochronous Locally Synchronous Clocking
  • 2004
  • Ingår i: International Conference On Hardware/Software Codesign And System Synthesis. - New York, USA : ASSOC COMPUTING MACHINERY. - 1581139373 ; , s. 176-181
  • Konferensbidrag (refereegranskat)abstract
    • One of the main problems when designing large ASICs today is to distribute a low power synchronous clock over the whole chip and a lot of remedies to this problem has been proposed over the years. For Networks-on-Chip (NoC), where computational Resources are organised in a 2-D mesh connected together through Switches in an on-chip interconnection network, another possibility exists: Globally Pseudochronous Locally Synchronous clock distribution. In this paper, we present a clocking scheme for NoCs that we call Globally Pseudochronous Locally Synchronous, in which we distribute a clock with a constant phase difference between he switches. As a consequence of the phase difference, some paths along the NoC switch network become faster than the others. We call these paths Data Motorways. By adapting the switching policy in the switches to prefer data to use the motorways, we show that the latency within the network is reduced with up to 40% compared to a synchronous reference case. The phase difference between the resources also makes the circuit more tolerant to clock skew. It also distributes the current peaks more evenly across the clock period, which lead to a reduction in peak power, which in turn further reduces the clock skew and the jitter in the clock network.
  •  
6.
  • Nilsson, Erland, et al. (författare)
  • Trading off Power versus Latency using GPLS Clocking in 2D-Mesh NoCs
  • 2005
  • Ingår i: Isscs 2005: International Symposium On Signals, Circuits And Systems, Vols 1 And 2, Proceedings. - New York, USA : IEEE. - 0780390296 ; , s. 51-54
  • Konferensbidrag (refereegranskat)abstract
    • To handle the design complexity when the number of transistors on-chip reaches one billion, new ways of organizing chips will be needed. One solution to this problem is to organize computational resources in a grid, where all communication between the resources are performed using an interconnection network. These networks are commonly referred to as Networks-on-Chip, or NoCs. This paper focus on the trade-off between power and latency while keeping the required interconnection bandwidth constant. The clock frequency can be lowered to reduce the power, with reduced bandwidth as a consequence, which in a synchronous system will increase the latency linearly. In a 2D-Mesh NoC structure, it is possible to choose the regions with different clock phase and arrange them in such ways that the latency from sender to receiver along certain paths is nearly constant, and the total average latency is reduced with 50%. The reduction can then be exploited to trade off latency vs. power; the GPLS solution consumes 50% or the power compared to the fully synchronous solution, at the same latency and constant throughput.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-6 av 6
Typ av publikation
konferensbidrag (4)
rapport (1)
licentiatavhandling (1)
Typ av innehåll
refereegranskat (4)
övrigt vetenskapligt/konstnärligt (2)
Författare/redaktör
Öberg, Johnny (6)
Jantsch, Axel (2)
Millberg, Mikael (2)
Nilsson, Erland, 197 ... (1)
Wiklund, Daniel, PhD (1)
Lärosäte
Kungliga Tekniska Högskolan (6)
Språk
Engelska (6)
Forskningsämne (UKÄ/SCB)
Teknik (6)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy