|
|
|
Sardar Anisul Haque, Mohammad Tanvir Parvez and Shahadat Hossain
Matrix?matrix multiplication is of singular importance in linear algebra operations with a multitude of applications in scientific and engineering computing. Data structures for storing matrix elements are designed to minimize overhead information as wel...
ver más
|
|
|
|
|
|
|
Mikhail Babenko, Elena Golimblevskaia, Andrei Tchernykh, Egor Shiriaev, Tatiana Ermakova, Luis Bernardo Pulido-Gaytan, Georgii Valuev, Arutyun Avetisyan and Lana A. Gagloeva
Homomorphic encryption (HE) is a promising solution for handling sensitive data in semi-trusted third-party computing environments, as it enables processing of encrypted data. However, applying sophisticated techniques such as machine learning, statistic...
ver más
|
|
|
|
|
|
|
Yang Wang, Jie Liu, Xiaoxiong Zhu, Qingyang Zhang, Shengguo Li and Qinglin Wang
Structured grid-based sparse matrix-vector multiplication and Gauss?Seidel iterations are very important kernel functions in scientific and engineering computations, both of which are memory intensive and bandwidth-limited. GPDSP is a general purpose dig...
ver más
|
|
|
|
|
|
|
Jiyang Yu, Dan Huang, Wenjie Li, Xianjie Wang and Xiaolong Shi
The method studied in this paper is applied to the control calculation of manipulator, especially the matrix chain multiplication in the calculation of forward and inverse kinematics solutions of manipulator.
|
|
|
|
|
|
|
Tamas Foldi, Chris von Csefalvay and Nicolas A. Perez
The new barrier mode in Apache Spark allows for embedding distributed deep learning training as a Spark stage to simplify the distributed training workflow. In Spark, a task in a stage does not depend on any other tasks in the same stage, and hence it ca...
ver más
|
|
|
|
|
|
|
?.?. Cherepniov
Pág. 9 - 13
In recent years, increasingly requires the use of algorithms that work effectively with machine words and such that the main work can be done in the processor cache, that is, the time to data overwrite less. It is important to note that the size of the r...
ver más
|
|
|
|
|
|
|
Felipe C. Farias, Teresa B. Ludermir and Carmelo J. A. Bastos-Filho
In this paper we propose a procedure to enable the training of several independent Multilayer Perceptron Neural Networks with a different number of neurons and activation functions in parallel (ParallelMLPs) by exploring the principle of locality and par...
ver más
|
|
|
|
|
|
|
Mikhail S. Malovichko,Nikolay E. Khokhlov,Nikolay B. Yavich,Michael S. Zhdanov
Pág. 74 - 78
We present a parallel algorithm for solution of the three-dimensional Helmholtz equation in the frequency domain by the method of volume integral equations. The algorithm is applied to seismic forward modeling. The method of integral equations reduces th...
ver más
|
|
|
|
|
|
|
Thaha Muhammed, Rashid Mehmood, Aiiad Albeshri and Iyad Katib
Sparse matrix-vector (SpMV) multiplication is a vital building block for numerous scientific and engineering applications. This paper proposes SURAA (translates to speed in arabic), a novel method for SpMV computations on graphics processing units (GPUs)...
ver más
|
|
|
|
|
|
|
Fiza Zafar, Alicia Cordero, Husna Maryam and Juan R. Torregrosa
Power flow problems can be solved in a variety of ways by using the Newton?Raphson approach. The nonlinear power flow equations depend upon voltages Vi" role="presentation">|????|Vi
V
i
and phase angle δ" role="presentation">??d
d
. An electri...
ver más
|
|
|
|
|
|
|
Daniel Gibney and Sharma V. Thankachan
Finding substrings of a text T that match a regular expression p is a fundamental problem. Despite being the subject of extensive research, no solution with a time complexity significantly better than ??(|??||??|)
O
(
|
T
|
|
p
|
)
has been found. Backu...
ver más
|
|
|
|
|
|
|
Xuerui Zheng, Jiping Jin, Yajun Wang, Min Yuan and Sheng Qiang
With the development of engineering technology, engineering has higher requirements for the accuracy and the scale of simulation calculation. The computational efficiency of traditional serial programs cannot meet the requirements of engineering. Therefo...
ver más
|
|
|
|
|
|
|
Kunal Banerjee,Evangelos Georganas,Dhiraj D. Kalamkar,Barukh Ziv,Eden Segal,Cristina Anderson,Alexander Heinecke
Pág. 64 - 85
Recurrent neural network (RNN) models have been found to be well suited for processing temporal data. In this work, we present an optimized implementation of vanilla RNN cell and its two popular variants: LSTM and GRU for Intel Xeon architecture. Typical...
ver más
|
|
|
|
|
|
|
A.Yu Gorchakov,V.U. Malkova
Pág. 12 - 17
In this paper, a comparative analysis of four types of processors is performed using the example of the problem of restoring the initial data for the transport equation. The problem is solved by the Levenberg-Marquardt method, which is decomposed into fo...
ver más
|
|
|
|
|
|
|
Mohammed Mahmoud, Mark Hoffmann and Hassan Reza
Sparse matrix-vector multiplication (SpMV) can be used to solve diverse-scaled linear systems and eigenvalue problems that exist in numerous, and varying scientific applications. One of the scientific applications that SpMV is involved in is known as Con...
ver más
|
|
|
|
|
|
|
Rehab Aljabri and Michael H. Meylan
A method is presented to calculate the vibrations of an ice shelf floating in shallow water under different boundary conditions. One condition is that there is no flux, which reduces all calculations and the other is that there is no pressure at the seaw...
ver más
|
|
|
|
|
|
|
Fatima Zahra Guerrouj, Sergio Rodríguez Flórez, Mohamed Abouzahir, Abdelhafid El Ouardi and Mustapha Ramzi
Convolutional Neural Networks (CNNs) have been incredibly effective for object detection tasks. YOLOv4 is a state-of-the-art object detection algorithm designed for embedded systems. It is based on YOLOv3 and has improved accuracy, speed, and robustness....
ver más
|
|
|
|
|
|
|
Trishala Chauhan, Shilpa Sindhu and Rahul S. Mor
The spike in internet users led healthcare companies to confer their agile presence on various digital platforms and engage customers online to increase their viability amid the rising competition. Online customer engagement takes place through branded c...
ver más
|
|
|
|
|
|
|
Andreas Pramudya, Andreas Wibowo
The Government of Indonesia implemented the Build, Operate, and Transfer (BOT) model, relying on private investment to bridge the financing gap in developing public infrastructure facilities, including toll roads. Toll road investments, like other greenf...
ver más
|
|
|
|
|
|
|
Andreas Pramudya, Andreas Wibowo
The Government of Indonesia implemented the Build, Operate, and Transfer (BOT) model, relying on private investment to bridge the financing gap in developing public infrastructure facilities, including toll roads. Toll road investments, like other greenf...
ver más
|
|
|
|