Streamlit and a Bert implementation for contract analysis and clause extraction.
Continuous Integration and Continuous Delivery applied to a Data Science project.
Introducciont to bash scripting to perform Data Science tasks.
A Step-By-Step Approach to Understand TextBlob, NLTK, Scikit-Learn, and LSTM networks applied to Sentiment Analysis.
This project is designed to create NLP as a service with code base for both front end GUI (streamlit) and backend server (FastApi) the usage of transformers models on various downstream NLP task.
I used spaCy package to identify the entities on a body of text. The functions available in spaCy are Token and lemmas, Name Entity Recognition, Sentiment Analysis and Summarization.
I used Docker containers to develop an Analytics solution in IoT able to run closed to the origin of the data. I used Lambda architecture to differentiate hot data from historical data. The container has built-in visualizations and a dashboarding software, this allows exploration of past data and visualization of new data. Also, Node-Red is included and has the function of ETL Extraction Transformation and Load and an MQTT broker.
Explore end-to-end the data mining process using CRISP-DM to analyses and reveals insights from sales transactions. CRISP-DM stands for the cross-industry process for data mining. This methodology provides a structured approach to planning a data mining project. Is included a container available with the data and the Jupyter Notebook.
Series of snippers and learning material developed to follow the workshops presented to Datagroup clients. The snippers contain the SQL statement to call the different algorithms available in SAP HANA.The analytical engine of SAP HANA can perfome Text Processing and Text Analytics on 32 languages.
This notebook focused in Named Entity Recognition (NER) is an application of Natural Language Processing (NLP) that processes and understands large amounts of unstructured human language. Also known as entity identification, entity chunking and entity extraction.
Student project I lead with The University Ulm and Datagroup GmbH. The solution is a container with tools and a dashboard to perform sentimental analysis on tweeter data. The typical use case is brand sentiment on social networks.
This repository contains a Jupyter notebook with EDA exploratory data analysis and a predictive model for maintenance. This is especially useful for an IIoT scenario.
Repository with visualization examples using the Plotly library. This is a good library for interactive dashboards and works very well with Flask in Webapps.