Analiza uporabe in postavitve podatkovnega jezeraMarcel Koren
, 2021, master's thesis
Abstract: Velepodatki in podatkovna jezera sta pojma, ki jih v zadnjih letih vedno pogosteje uporabljamo v povezavi s porastom količine ustvarjenih podatkov. V magistrskem delu predstavljamo lastnosti podatkovnih jezer, čemu so namenjena, kako jih lahko vzpostavimo ter kako so povezana z velepodatki. Podrobno opišemo odprtokodno rešitev Apache Hadoop in oblačno rešitev Microsoft Azure Data Lake. Pri tem smo spoznali tudi orodja, ki jih rešitvi ponujata, med katerimi sta pomembnejši Apache Spark in Azure Databricks. V nadaljevanju predstavljamo, kako ju vzpostavimo ter izvedemo eksperiment, kjer na podlagi hitrosti izvajanja in stroškov spoznamo njune prednosti in slabosti.
Keywords: velepodatki, podatkovna jezera, Hadoop, Spark, Azure Data Lake
Published: 16.12.2021; Views: 94; Downloads: 14
Full text (2,31 MB)
A Comparison of Traditional and Modern Data Warehouse ArchitecturesRok Virant
, 2021, undergraduate thesis
Abstract: Data has never been as desired or valued as it is today. The value of data and information over the past decade has not only changed trends in business and the IT industry but has also changed the dynamic of work. Enormous amounts of aggregate data offer companies and other corporations the option to explore and study data samples. Data collection and information processing are new dynamic factors, not only for individuals but also for corporations. Companies and corporations who are able to process large amounts of data in the shortest possible time can place themselves in a leading position in certain professions. In this bachelor’s thesis we will describe the basic concepts and factors that have shaped new, cloud-based data warehouse technologies. At the same time, we also emphasize why and how these technologies are used. We focus on how the changing technology influenced the users and their consumption of data, the changing dynamics of work as well as the changes of data itself. In the practical part, we created two DWH environments (on-premises and cloud) that we compare with each other. In the experiment, we underlined the fact that CDWHs are in certain situations not always faster than TDWH.
Keywords: Data Warehouses, Cloud Computing, Outsourcing, Data, Information
Published: 18.10.2021; Views: 138; Downloads: 16
Full text (3,58 MB)
Crosswalk of most used metadata schemes and guidelines for metadata interoperability (Version 1.0)Milan Ojsteršek
, 2021, complete scientific database or corpus
Abstract: This resource provides crosswalks among the most commonly used metadata schemes and guidelines to describe digital objects in Open Science, including:
- RDA metadata IG recommendation of the metadata element set,
- EOSC Pilot - EDMI metadata set,
- Dublin CORE Metadata Terms,
- Datacite 4.3 metadata schema,
- DCAT 2.0 metadata schema and DCAT 2.0 application profile,
- EUDAT B2Find metadata recommendation,
- OpenAIRE Guidelines for Data Archives,
- OpenAire Guidelines for literature repositories 4.0,
- OpenAIRE Guidelines for Other Research Products,
- OpenAIRE Guidelines for Software Repository Managers,
- OpenAIRE Guidelines for CRIS Managers,
- Crossref 4.4.2 metadata XML schema,
- Harvard Dataverse metadata schema,
- DDI Codebook 2.5 metadata XML schema,
- Europeana EDM metadata schema,
- The PROV Ontology.
Keywords: crosswalk, metadata, EDMI metadata set, Dublin CORE, Datacite 4.3 metadata schema, DCAT 2.0 metadata schema, UDAT B2Find metadata recommendation, OpenAIRE Guidelines for Data Archives, OpenAire Guidelines for literature repositories 4.0, OpenAIRE Guidelines for Other Research Products, OpenAIRE Guidelines for Software Repository Managers, OpenAIRE Guidelines for CRIS Managers, Crossref 4.4.2 metadata XML schema, Harvard Dataverse metadata schema, DDI Codebook 2.5 metadata XML schema, Europeana EDM metadata schema, Schema.org, Bioschemas, The PROV Ontology
Published: 21.09.2021; Views: 229; Downloads: 11
Research data (169,58 KB)
This document has many files! More...
Proceedings of the 2021 7th Student Computer Science Research Conference (StuCoSReC)
Abstract: The 7th Student Computer Science Research Conference is an answer to the fact that modern PhD and already Master level Computer Science programs foster early research activity among the students. The prime goal of the conference is to become a place for students to present their research work and hence further encourage students for an early research. Besides the conference also wants to establish an environment where students from different institutions meet, let know each other, exchange the ideas, and nonetheless make friends and research colleagues. At last but not least, the conference is also meant to be meeting place for students with senior researchers from institutions others than their own.
Keywords: student conference, computer and information science, artificial intelligence, data science, data mining
Published: 13.09.2021; Views: 216; Downloads: 17
Full text (11,87 MB)
This document has many files! More...
Implementation of a new reporting process in a group xSara Črešnik
, 2021, master's thesis
Abstract: Reporting is present in every company. Whether it is small or big, it cannot be avoided. It plays a crucial role in the process and progress of business. The quality of reporting affects the development of the work environment and the company. Since business report is a document that contains business information, which supports the decisions about the future-oriented business decisions, it is very important for it to be designed in such a way that it contains the key information for the recipient and provides support for business decisions. The reporting process can take place horizontally upwards or downwards. Content and structure vary depending on the recipient of the report. We live in an age when our every step is accompanied by digitization, computerization, artificial intelligence, mass data, the Internet of Things, machine learning, and robotics. These changes have also affected the reporting process as well as its processes. The processes of data acquisition, processing and sharing have changed. Furthermore, the data quantity has increased, whereas the speed of the time in which to prepare the reports has decreased. We can have data without information, but we cannot have information without data. There is never enough time, especially nowadays when we are used to having everything at our fingertips. These are two conflicting factors – having more data and less time to prepare quality reports. The systems are developed to optimize the process, increase efficiency and quality and, what is nowadays most important, they have been created to obtain mass data in the shortest possible time. Therefore, it is important to adapt and implement software that can help achieve our daily tasks. We must know how to process huge amounts of real-time data and deliver the information they contain. It is crucial for companies to keep up with the environment and implement changes and innovations into their business process. A company is like a living organism for it must constantly evolve and grow. As soon as it stops growing and evolving, it can fail because it starts lagging and is therefore no longer competitive to others. To deliver faster feedback, companies need data of better quality. There are tools that can improve the business process, better facilitating the capacity of the human agents. The goal is to harness the employees’ full potential and knowledge for important tasks, such as analyzing, reviewing, and understanding data and acting upon them, invoking information technology to automate repetitive processes and facilitate better communication.
The focus in this master’s thesis is on the reporting process in Group X. Group X is one of the world leaders in the automotive industry, a multinational corporation based in Canada with subsidiaries around the world. The complexity of the business reporting that is implemented for the Headquarters in Canada has to address the complexity of the multinational corporation to support the decision process.
The aim of the thesis is to propose a reporting process for preparing and producing reports with a huge amount of data in a very time-efficient manner. We start by examining the existing processes and upon that, identifying the processes required for the reports to reach the final recipients. Our goal is to identify the toolset, which would increase efficiency, accuracy, credibility, and reduce errors in the fastest possible time. We investigate a short-term and a long-term solution. By a short-term solution, we mean a system, program, or a tool that can help us increase our potential by using digital resources, which are already existing in the organization. By a long-term solution, we mean a solution, which requires employment of specialized future tools in the field of reporting and in repetitive processes, which we can identify with current knowledge and expectations for development. This includes machine learning, robotic process automatization, artificial intelligence.
Keywords: Consolidated reporting, reporting process, robotic process automatization, business intelligence, artificial intelligence, machine learning, SharePoint, Big Data, digital transformation, electronic data interchange.
Published: 01.09.2021; Views: 144; Downloads: 3
Full text (1,71 MB)
Trendi digitalizacije v podjetju-industrija 4.0Sara Vaupotič
, 2020, undergraduate thesis
Abstract: Sam začetek industrijske revolucije sega v začetke druge polovice 18.stoletja z začetkom parne lokomotive ter strojev za predenje. Skozi zgodovino so ljudje vedno iskali izboljšave ter delali s tem, kar so imeli na voljo. Od samih začetkov parnih strojev, odkritja električne energije, prvih telegramov in telefonov, avtomobilov, letal, vse do razvoja digitalne tehnologije, poslovne programske opreme, razvoja prvih računalnikov ter superračunalnikov, razvoja komunikacijske tehnologije, prvih prenosnikov ter industrijske robotike, pa se trenutno razvija četrta industrijska revolucija, znana kot industrija 4.0. Razvija se v smeri digitalizacije in avtomatizacije, pametnih tovarn in naprav, povezanih med seboj (angl. Internet of Things), sistemov za shranjevanje velikih količin informacij ter podatkov (angl. Big Data) in proizvodnih zmogljivosti, ki lahko podatke shranjujejo samostojno kadarkoli in brez človeške prisotnosti. Tako proizvodnja, kot poslovanje potekata v veliki meri digitalno. Digitalizacija, ki je že zamenjala nekoč tradicionalno kulturo v podjetju, vso papirno hrambo so že nadomestile različne računalniške rešitve ter sistemi za lažje, bolj pregledno, brez papirno ter hitrejšo poslovanje (DMS). Za hitrejši dostop do podatkov, lažje in bolj pregledno poslovanje ter zbranost podatkov na enem mestu pa skrbijo sistemi za načrtovanje in pregledno planiranje virov podjetja (ERP), ki se lahko povezujejo tudi s sistemi za upravljanje proizvodnje (MES). Prav zaradi potrebe po izboljšavah, večji učinkovitosti, lažjim pregledom nad stroški in logistiko pa je nastala četrta industrijska revolucija. Kot dober primer podjetja, ki smernice četrte industrijske revolucije že v večji meri upošteva, pa bomo predstavili tehnološko podjetje (Xiaomi), ki trenutno zaseda četrto mesto na trgu pametnih telefonov.
Keywords: industrijske revolucije, industrija 4.0, IoT, Big Data, ERP, MES, DMS, elektronsko poslovanje, Xiaomi.
Published: 23.11.2020; Views: 452; Downloads: 120
Full text (767,12 KB)
The Impact of the EU General Data Protection Regulation (GDPR) on mobile devicesDomen Hribar
, Miha Dvojmoč
, Blaž Markelj
, 2018, original scientific article
The purpose of this paper is to examine novelties introduced by the European Regulation (2016/679) on the Protection of Natural Persons with Regard to the Processing of Personal Data (GDPR) and its key impacts on mobile device users. The paper also presents some of the main changes affecting both natural persons and legal entities. Further, certain issues that might occur while implementing the Regulation are raised together with the degree of individuals’ awareness of the need to protect the personal data stored on their mobile devices.
For the purpose of this paper, we reviewed the legislation, Slovenian and international literature, brochures and media stories in the field of personal data protection. We also used a questionnaire to determine the degree of awareness of the importance of protecting personal data among the general population.
The findings show that no revolutionary changes are introduced. Nevertheless, quite a few novelties concern data controllers and processors. In particular, penalties for breaching the GDPR are now much higher. Individuals’ rights are strengthened and easier to control. In contrast, data controllers and processors are subject to more stringent duties and legal obligations. These changes also apply to mobile device users. The research findings show that individuals are relatively well aware of the concept of personal data; however, the scope of their knowledge shrinks as this concept becomes increasingly complex. Familiarity with the new Regulation (2016/679) having been introduced at the EU level was claimed by 55% of the respondents (N = 195).
Research Limitations / Implications:
The limitations stem from the selective choice of the GDPR’s impact on mobile device users. More important influences are emphasised.
The findings will help both individuals and legal entities understand the changes brought to the area of data protection and tackle them more successfully.
Keywords: personal data protection, GDPR, Personal Data Protection Act, mobile devices
Published: 20.04.2020; Views: 424; Downloads: 26
Full text (366,32 KB)
This document has many files! More...
Blockchain-Based digital identity managementNejc Schneider
, 2019, master's thesis
Abstract: Centralized identity management systems are most commonly used. However, they pose certain threats to identity owners using them. Most of these problems revolve around users not being in control of their data. Self-sovereign identity seeks to aid this issue by putting the user in control. The research goal is to find out whether decentralized networks implementation in the domain of identity management can prove beneficial for the identity owners. This research addresses the following questions:
• How is the user data managed on centralized servers and how can it be managed in a decentralized environment?
• What are the blockchain features that could benefit the domain of digital identity management?
• Can the blockchain-based solutions fulfill the requirements of SSI as well as GDPR regulations?
• What new challenges do they bring and why has a massive adaptation of blockchain technology beyond the field of cryptocurrencies not yet occurred?
Previous academic research and industry reports show the massive potential of Blockchain technology. However, there are only a few examples of how to implement this technology to overcome the barriers (Wadhwa, 2019). Blockchain offers immutability, which can be very valuable for certain purposes. However, it may not be an ideal solution for storing personal data. To offer a comprehensive solution, our use-case combines both centralized and decentralized data storage to protect the user data.
Keywords: blockchain, self-sovereign identity, management systems, identity data, Sovrin network.
Published: 25.11.2019; Views: 682; Downloads: 236
Full text (1,03 MB)
General Guidelines for Academic IntegrityLoreta Tauginienė
, Milan Ojsteršek
, Tomáš Foltýnek
, Franca Marino
, Marco Consentino
, Inga Gaižauskaitė
, Irene Glendinning
, Shiva Sivasubramaniam
, Salim Razi
, Laura Ribeiro
, Tatjana Odiņeca
, Oliver Trevisiol
, 2018, scientific monograph
Abstract: These general guidelines serve as a supportive document for the glossary for academic integrity (sub-output 3G) that describes the definitions of terms related to academic integrity. They serve to help build common understandings of integrity issues in both academia and business. The guidelines outline minimum requirements and responsibilities of various stakeholders in academia. Many of the guidelines are necessarily general, but, where relevant, we provide country-specific examples as well as adjusting to meet to the needs of different fields of study/research.
The general guidelines are addressed to a full range of stakeholders, including national ombudspersons, judicial authorities, compliance officers, research project managers and other related bodies or units in academia (such as policy units, educators/instructors, senior administrator/managers/coordinators) as well as students, the business sector and others.
Keywords: guidelines, academic integrity, academic writing, academic publishing, ethics, plagiarism, copyright, data fabrication, data falsification, cheating
Published: 15.10.2019; Views: 992; Downloads: 51
Full text (1001,05 KB)
This document has many files! More...
; This document is also a collection of 2 documents!