↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Träfflista för sökning "WFRF:(Daneshtalab M.) "

Sökning: WFRF:(Daneshtalab M.)

Resultat 1-10 av 65

Sortera/gruppera träfflistan

Sortering: Träffar per sida:

Numrering	Referens	Omslagsbild	Hitta
1.	Taheri, M., et al. (författare) DeepAxe : A Framework for Exploration of Approximation and Reliability Trade-offs in DNN Accelerators 2023 Ingår i: Proceedings - International Symposium on Quality Electronic Design, ISQED. - : IEEE Computer Society. - 9798350334753 Konferensbidrag (refereegranskat)abstract While the role of Deep Neural Networks (DNNs) in a wide range of safety-critical applications is expanding, emerging DNNs experience massive growth in terms of computation power. It raises the necessity of improving the reliability of DNN accelerators yet reducing the computational burden on the hardware platforms, i.e. reducing the energy consumption and execution time as well as increasing the efficiency of DNN accelerators. Therefore, the trade-off between hardware performance, i.e. area, power and delay, and the reliability of the DNN accelerator implementation becomes critical and requires tools for analysis.In this paper, we propose a framework DeepAxe for design space exploration for FPGA-based implementation of DNNs by considering the trilateral impact of applying functional approximation on accuracy, reliability and hardware performance. The framework enables selective approximation of reliability-critical DNNs, providing a set of Pareto-optimal DNN implementation design space points for the target resource utilization requirements. The design flow starts with a pre-trained network in Keras, uses an innovative high-level synthesis environment DeepHLS and results in a set of Pareto-optimal design space points as a guide for the designer. The framework is demonstrated on a case study of custom and state-of-the-art DNNs and datasets.
2.	Ahmadilivani, M. H., et al. (författare) A Systematic Literature Review on Hardware Reliability Assessment Methods for Deep Neural Networks 2024 Ingår i: ACM Computing Surveys. - : ASSOC COMPUTING MACHINERY. - 0360-0300 .- 1557-7341. ; 56:6 Tidskriftsartikel (refereegranskat)abstract Artificial Intelligence (AI) and, in particular, Machine Learning (ML), have emerged to be utilized in various applications due to their capability to learn how to solve complex problems. Over the past decade, rapid advances in ML have presented Deep Neural Networks (DNNs) consisting of a large number of neurons and layers. DNN Hardware Accelerators (DHAs) are leveraged to deploy DNNs in the target applications. Safety-critical applications, where hardware faults/errors would result in catastrophic consequences, also benefit from DHAs. Therefore, the reliability of DNNs is an essential subject of research. In recent years, several studies have been published accordingly to assess the reliability of DNNs. In this regard, various reliability assessment methods have been proposed on a variety of platforms and applications. Hence, there is a need to summarize the state-of-the-art to identify the gaps in the study of the reliability of DNNs. In this work, we conduct a Systematic Literature Review (SLR) on the reliability assessment methods of DNNs to collect relevant research works as much as possible, present a categorization of them, and address the open challenges. Through this SLR, three kinds of methods for reliability assessment of DNNs are identified, including Fault Injection (FI), Analytical, and Hybrid methods. Since the majority of works assess the DNN reliability by FI, we characterize different approaches and platforms of the FI method comprehensively. Moreover, Analytical and Hybrid methods are propounded. Thus, different reliability assessment methods for DNNs have been elaborated on their conducted DNN platforms and reliability evaluation metrics. Finally, we highlight the advantages and disadvantages of the identified methods and address the open challenges in the research area. We have concluded that Analytical and Hybrid methods are light-weight yet sufficiently accurate and have the potential to be extended in future research and to be utilized in establishing novel DNN reliability assessment frameworks.
3.	Ahmadilivani, M. H., et al. (författare) Enhancing Fault Resilience of QNNs by Selective Neuron Splitting 2023 Ingår i: AICAS 2023 - IEEE International Conference on Artificial Intelligence Circuits and Systems, Proceeding. - : Institute of Electrical and Electronics Engineers Inc.. - 9798350332674 Konferensbidrag (refereegranskat)abstract The superior performance of Deep Neural Networks (DNNs) has led to their application in various aspects of human life. Safety-critical applications are no exception and impose rigorous reliability requirements on DNNs. Quantized Neural Networks (QNNs) have emerged to tackle the complexity of DNN accelerators, however, they are more prone to reliability issues.In this paper, a recent analytical resilience assessment method is adapted for QNNs to identify critical neurons based on a Neuron Vulnerability Factor (NVF). Thereafter, a novel method for splitting the critical neurons is proposed that enables the design of a Lightweight Correction Unit (LCU) in the accelerator without redesigning its computational part.The method is validated by experiments on different QNNs and datasets. The results demonstrate that the proposed method for correcting the faults has a twice smaller overhead than a selective Triple Modular Redundancy (TMR) while achieving a similar level of fault resiliency.
4.	Ebrahimi, M., et al. (författare) An Efficient Dynamic Multicast Routing Protocol for Distributing Traffic in NOCs 2009 Ingår i: 2009 Design, Automation and Test in Europe Conference and Exhibition, DATE '09. ; , s. 1064-1069 Konferensbidrag (refereegranskat)abstract Nowadays, in MPSoCs and NoCs, multicast protocol is significantly used for many parallel applications such as cache coherency in distributed shared-memory architectures, clock synchronization, replication, or barrier synchronization. Among several multicast schemes proposed in on chip interconnection networks, path-based multicast scheme has been proven to be more efficient than the tree-based, and unicast-based. In this paper a low distance path-based multicast scheme is proposed. The proposed method takes advantage of the network partitioning, and utilizing of an efficient destination ordering algorithm. The results in performance, and power consumption show that the proposed method outstands the previous on chip path-based multicasting algorithms.
5.	Ebrahimi, M., et al. (författare) HARAQ : Congestion-Aware Learning Model for Highly Adaptive Routing Algorithm in On-Chip Networks 2012 Ingår i: Proceedings of the 2012 6th IEEE/ACM International Symposium on Networks-on-Chip, NoCS 2012. ; , s. 19-26 Konferensbidrag (refereegranskat)abstract The occurrence of congestion in on-chip networks can severely degrade the performance due to increased message latency. In mesh topology, minimal methods can propagate messages over two directions at each switch. When shortest paths are congested, sending more messages through them can deteriorate the congestion condition considerably. In this paper, we present an adaptive routing algorithm for on-chip networks that provide a wide range of alternative paths between each pair of source and destination switches. Initially, the algorithm determines all permitted turns in the network including 180-degree turns on a single channel without creating cycles. The implementation of the algorithm provides the best usage of all allowable turns to route messages more adaptively in the network. On top of that, for selecting a less congested path, an optimized and scalable learning method is utilized. The learning method is based on local and global congestion information and can estimate the latency from each output channel to the destination region.
6.	Fallah, M. K., et al. (författare) Scalable parallel genetic algorithm for solving large integer linear programming models derived from behavioral synthesis 2020 Ingår i: Proceedings - 2020 28th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2020. - : Institute of Electrical and Electronics Engineers Inc.. - 9781728165820 ; , s. 390-394 Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract Solving Integer Linear Programming (ILP) models generally lies in the category of NP-hard problems. Therefore, as the size of ILP models grows, the efficiency of exact algorithms for solving the models reduced significantly and for large models it is not possible to have the result. Genetic Algorithm (GA) is a metaheuristic method capable of adjusting and redesigning parameters and operations according to the characteristics of ILP models. Still GA has huge search space for large models and parallelization is a suitable technique to tackle this problem. This paper presents a scalable parallel GA to solve large ILP models derived from behavioral synthesis of digital circuits. We show that although models have non-binary variables, only binary variables are sufficient for coding chromosomes. We also use 'unknown' values for some genes to decrease the likelihood of inconsistency in the encoded constraints. Our experiments verify the efficiency and scalability of the proposed algorithm on multicore platforms. The proposed method outperforms IBM ILOG CPLEX 12.6 and MI-LXPM algorithm where the ILP models include 550 to 2258 int / binary decision variables. Also, the results indicate that the saturation point of using parallel processing elements for solving the large ILP models is at least 60.
7.	Daneshtalab, M., et al. (författare) A Low-Latency and Memory-Efficient On-chip Network 2010 Ingår i: NOCS 2010. ; , s. 99-106 Konferensbidrag (refereegranskat)abstract Using multiple SDRAMs in MPSoCs and NoCs to increase memory parallelism is very common nowadays. In-order delivery, resource utilization, and latency are the most critical issues in such architectures. In this paper, we present a novel network interface architecture to cope with these issues efficiently. The proposed network interface exploits a resourceful reordering mechanism to handle the in-order delivery and to increase the resource utilization. A brilliant memory controller is efficiently integrated into this network interface to improve the memory utilization and reduce both memory and network latencies. In addition, to bring compatibility with existing IP cores the proposed network interface utilizes AXI transaction based protocol. Experimental results with synthetic test cases demonstrate that the proposed architecture gives significant improvements in average network latency (12%), average memory access latency (19%), and average memory utilization (22%).
8.	Daneshtalab, M., et al. (författare) CMIT : A novel cluster-based topology for 3D stacked architectures 2010 Ingår i: IEEE 3D System Integration Conference 2010, 3DIC 2010. Konferensbidrag (refereegranskat)abstract Combining the benefits of 3D IC and Network-on-Chip (NoC) schemes, provides a significant performance gain for 3D stacked architectures. In recent years, Through-Silicon-Via (TSV), employed for inter-layer connectivity (vertical channel), has attracted a lot of interest since it enables faster and more power efficient inter-layer communication across multiple stacked layers. However, the area overhead of TSVs reduces wafer utilization and yield which impact design of 3D architectures using a large number of TSVs. In this paper, we propose a novel stacked topology, named CMIT (Cluster Mesh Inter-layer Topology) for 3D architectures to reduce the area overhead of TSVs and power dissipation on each layer with minimal performance penalty. Experimental results with synthetic test cases demonstrate that the presented topology can save more than 75% of TSV area footprint and reduces more than 10% of power consumption with a negligible performance overhead.
9.	Daneshtalab, M., et al. (författare) High-performance on-chip network platform for memory-on-processor architectures 2011 Ingår i: 6th International Workshop on Reconfigurable Communication-Centric Systems-on-Chip, ReCoSoC 2011 - Proceedings. Konferensbidrag (refereegranskat)abstract Three Dimensional Integrated Circuits (3D ICs) are emerging to improve existing Two Dimensional (2D) designs by providing smaller chip areas, higher performance and lower power consumption. Stacking memory layers on top of a multiprocessor layer (logic layer) is a potential solution to reduce wire delay and increase the bandwidth. To fully employ this capability, an efficient on-chip communication platform is required to be integrated in the logic layer. In this paper, we present an on-chip network platform for the logic layer utilizing an efficient network interface to exploit the potential bandwidth of stacked memory-on-processor architectures. Experimental results demonstrate that the platform equipped with the presented network interface increases the performance considerably.
10.	Daneshtalab, M., et al. (författare) High-Performance TSV Architecture for 3-D ICs 2010 Ingår i: Proceedings - IEEE Annual Symposium on VLSI, ISVLSI 2010. - : Institute of Electrical and Electronics Engineers (IEEE). - 9781424473212 ; , s. 467-468 Konferensbidrag (refereegranskat)abstract Three-dimensional integrated circuits (3-D ICs) outperform traditional planar ICs in terms of performance, packaging density, interconnection power consumption, and functionality. Since the performance of 3-D ICs employing Through Silicon Vias (TSVs) depends on vertical interlayer interconnects, in this paper we present a high-performance bus architecture for TSVs.

Skapa referenser, mejla, bekava och länka

Länka till träfflistan

Resultat 1-10 av 65

Avgränsa träffmängd

Typ av publikation: konferensbidrag (51); tidskriftsartikel (11); bokkapitel (2); samlingsverk (redaktörskap) (1)

Typ av innehåll: refereegranskat (61); övrigt vetenskapligt/konstnärligt (4)

Författare/redaktör: Daneshtalab, Masoud (45); Daneshtalab, M. (20); Ebrahimi, M (19); Tenhunen, Hannu (19); Liljeberg, P. (16); Plosila, J. (14); visa fler...; Modarressi, M. (10); Raik, J. (7); Palesi, M. (6); Wang, X. (5); Hemani, Ahmed (5); Taheri, M. (5); Jenihhin, M. (5); Sinaei, Sima (5); Ahmadilivani, M. H. (4); Loni, Mohammad (4); Jafri, Syed M. A. H. (4); Sjödin, Mikael, 1971 ... (3); Mohammadi, S. (3); Salehi, M. E. (3); Abdollahi, M (2); Ashjaei, Seyed Moham ... (2); Mubeen, Saad (2); Rezaei, A (2); Lisper, Björn (2); Nazari, N (2); Mousavi, Hamid (2); Kargahi, M. (2); Plosila, Juha (2); Sjodin, M. (2); Loni, Mohammad, PhD ... (2); Troubitsyna, Elena (2); Safari, S (2); Nolin, Mikael, 1971- (2); Ellervee, Peeter (2); Namazi, A (2); Sjödin, M (2); Paul, Kolin (2); Firuzan, A. (2); Hojabr, R. (2); Houtan, Bahar, 1989- (2); Aybek, M. O. (2); Tajammul, Adeel (2); Kokhazadeh, M. (2); Kokhazad, Z. (2); Dehyadegari, M. (2); Majd, A. (2); Riazati, Mohammad (2); Yasoubi, A. (2); Patti, D. (2); visa färre...

Lärosäte: Kungliga Tekniska Högskolan (38); Mälardalens universitet (32); RISE (5); Linköpings universitet (1)

Språk: Engelska (65)

Forskningsämne (UKÄ/SCB): Teknik (38); Naturvetenskap (21)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

Copyright © LIBRIS - Nationella bibliotekssystem
LIBRIS.kb.se

pil uppåt

Stäng

Kopiera och spara länken för att återkomma till aktuell vy