sanjaybhatikar

About sanjaybhatikar

Posts by :

Manage Docker containers using Portainer.

May 19 2024

Manage Docker Containers on Your Raspberry Pi With Portainer

sanjaybhatikar Blog Docker, Linux, Portainer, Portainer-CE, RaspberryPi, Raspbian

Docker is addictive. As it becomes an indispensable tool in our toolkit, we need a way to streamline container management. Docker offers Docker Desktop, a GUI application, for Mac and Windows users. For Raspberry Pi, I recommend Portainer.

Docker offers Docker Desktop for Mac or Windows users. For Raspberry Pi running Raspbian, I recommend Portainer. Use it to start, stop, modify or remove containers and monitor usage statistics. Portainer runs in own container.

Portainer provides an intuitive web-based interface to manage Docker containers, making it easy to start, stop, modify, or remove containers and monitor usage statistics. It runs in its own container and is well-suited for single-board computers (SBCs) like the Raspberry Pi running Raspbian.

Here’s how to get Portainer up and running on your Raspberry Pi:

Pull the image: docker pull portainer/linux-arm
Spin up the container with docker run command like so:

	docker run --name portainer_service --network docker-net \
	-d -p 9000:9000 \
	-v /var/run/docker.sock:/var/run/docker.sock \
	-v ~/Your/path/to/data:/data \
	portainer/portainer:linux-arm

Access Portainer: Open your web browser and go to http://localhost:9000. That’s it!

Now access Portainer from: http://localhost:9000. That is it!

Deploying AI Models in Real-World Situations

Being an AI practitioner means not only building models but also being familiar with various environments in which these models are deployed. Field applications often leverage platforms such as SBCs. The Raspberry Pi is a popular SBC that runs Linux and integrates seamlessly with various sensors and actuators.

In our FastAI course, you’ll learn from Ph.D. instructors who have invaluable expertise in both model building and the engineering required to deploy these models in real-world situations. We cover essential tools like Docker and Portainer, equipping you with the skills needed to put your AI applications into the hands of users effectively.

Join us to bridge the gap between developing AI models and deploying them in practical environments.

Dockerize data assets such as database and SQL client on Raspberry Pi.

May 18 2024

Baking Data Apps Into Raspberry Pi

sanjaybhatikar Architecture Docker, pgadmin, pgadmin4, postgres, RaspberryPi, SQL

In an earlier blogpost, we looked at why Docker is such an indispensable tool in the developer or data scientist’s toolkit, especially for shipping code that runs consistently across different environments. The Raspberry Pi, with its ARM architecture, provides a great example. Running software developed in traditional x86 environments on the Raspberry Pi can be challenging. Docker helps surmount these challenges and saves us endless frustration.

Show Me The Money Expense-Tracking App

I recently built an app for tracking expenses, particularly when my kid spent too much money on Zomato, a food delivery app! The idea was to set a monthly budget and track expenses against it. Visualizing data is crucial for self-adjustment on the path to improvement. The expense tracking system I built allows tracking expenses in common categories such as groceries, restaurants, utilities, etc.

The UI in PyQt6 for CRUD operations on Postgres warehouse. The MVC has Python with business objects in Pydantic and the SQLAlchemy ORM. — The expenses are managed through the app with a PyQt6 app that has a Postgres backend. The user can create a new expense, retrieve it for viewing, update it and apply changes or delete it. The Python implementation uses PyQt6 for UI and Pydantic for business objects with SQLAlchemy ORM.

I developed the app in the Python programming language. The app’s GUI, built in PyQt6, supports CRUD operations on a Postgres database. Users can create, view, update, and delete expenses. The architecture is a standard MVC framework with Pydantic business objects and SQLAlchemy ORM.

Additionally, I built a dashboard in Grafana, a browser-based dashboarding tool. I developed the app on my MacBook and deployed it on the “always on” Raspberry Pi 400 at home.

Eat Humble Pie Code portability issues

I encountered many hurdles when porting code to the Raspberry Pi. The PyQt6 library couldn’t be installed in a virtual environment using the pip installer, so I installed it system-wide from the Debian repository. This prevented running my Python code in the virtual environment. Consequently, other dependencies like Pydantic and SQLAlchemy also had to be installed system-wide.

I built an expense tracking application to monitor day-day expenses and generate informative summaries. I used Grafana to construct a dashboard featuring interactive visualizations. — Get Grafana on Raspberry Pi. Build and share interactive visualizations.

I pulled the official Docker image for Postgres to get the database running. See this blogpost. However, finding a suitable SQL client was surprisingly difficult. I tried TablePlus, DBeaver, and Pgdmin, but each had issues post-installation, rendering them unusable.

A bad workman always blames his tools. Was I using the wrong tools or is the Raspberry Pi just Rubbish Pi?

It's Raining Containers Containerize the asset

The Raspberry Pi, being resource-constrained compared to a desktop, poses challenges in installing and running software. Docker can be invaluable in such situations. I was already running Postgres in a Docker container on the Raspberry Pi and found ARM-compatible containers for Pgadmin and Grafana. The pgAdmin container, provided by Elestio, is available as elestio/pgadmin.

Here’s how to install it:

Pull the image: docker pull elestio/pgadmin
Run the container:

	docker run --name pgadmin_service \
	-d -p 8080:8080 \
	--network docker-net \
	-e PGADMIN_DEFAULT_EMAIL=handsomest.coder@email.com \
	-e PGADMIN_DEFAULT_PASSWORD=topsecret \
	-e PGADMIN_LISTEN_PORT=8080 \
	-v ~/path/to/pgadmin/servers.json:/pgadmin4/servers.json \
	elestio/pgadmin

This command starts a Pgadmin container named pgadmin_service in the background, accessible via http://localhost:8080, connected to the docker-net network, using the specified email and password for initial login credentials, and mounting a servers.json file for predefined server connections.

Don't Dally, Do Docker Summary

Using Docker images for Pgadmin and Postgres provided a smooth path to setting up these vital data assets on my Raspberry Pi. This approach should generally be the default for installing software in app development. For example, when building home automation systems, it’s beneficial to containerize services like Mosquito MQTT, Node-Red, InfluxDB, and Grafana that are part of developer’s IoT stack. Refer to this YouTube video for a guide. For managing all these containers with a graphical UI, use Portainer, available as a free community edition from Docker.

Unlock the full potential of AI with our FastAI course, where you’ll not only master AI integration into applications but also gain essential skills like Docker, crucial for deploying and managing your apps in real-world environments. Learn to bridge the gap between development and practical implementation, ensuring your AI solutions are robust, scalable, and ready for end-users. Join us to transform your technical expertise into powerful, customer-ready applications.

Docker containers make code easy to ship

May 6 2024

Docker. Why bother?

sanjaybhatikar Blog, Python Container, Deployment, Docker, Docker Hub, dockerfile, Microservices, postgres, Scrapy, Spider, Web Crawler, Web Scraping

Back in the day, when I wrote apps in C/C++, I compiled the code into an executable for shipping. When we code in Python, how do we ship code?

We could simply send our code to the customer to run it on their machine. But the environment in which our code would run at the customer’s end would almost never be identical to ours. Small differences in environment could mean our code doesn’t run and debugging such issues is a colossal waste of time, not to mention repeating the process for every customer.

But there is a better way and that is Docker!

Microservice Web-Crawling Spider

The micro service, which is part of a larger app, features a spider that crawls web domains for content that it warehouses in a postgres database.

Here is a microservice I built that was part of a larger app. It has a spider that crawls multiple web domains for content that it scrapes and puts into a Postgres warehouse. I built out the spider in Scrapy framework in Python 3 and used the psycopg2 client for database CRUD operations.

Shipping the code means replicating the environment on the machine where it will run. In the process, small changes may creep up. The version of Python or its dependencies may differ. The version of Postgres may also differ. The devil lies in details! Small differences can throw a spanner in the works. That is why, shipping code in this manner is not recommended.

Instead, dockerize the app!

Ship It With Docker Encapsulate Code for Cross-Platform Deployment

Let’s start by dockerizing the Postgres warehouse. The steps are pulling the docker image from docker hub and then spinning up the container!

Pull the image like so: docker pull postgres

Then spin up the container like so: docker run --name postgres_service --network scrappy-net -e POSTGRES_PASSWORD=topsecretpassword -d -p 5432:5432 -v /Your/path/to/volume:/var/lib/postgresql/data postgres

This command not only launches the container but also connects it to the Docker network for seamless communication among containers. (Refer this blogpost.) Additionally, it ensures data persistence by sharing a folder between the host machine and the container.

Let’s break down the docker run command into its constituent parts:

docker run: This is the command used to create and start a new container based on a specified image.
--name postgres_service: This flag specifies the name of the container. In this case, the container will be named “postgres_service”.
--network scrappy-net: This flag specifies the network that the container should connect to. In this case, the container will connect to the network named “scrappy-net”.
-e POSTGRES_PASSWORD=topsecretpassword: This flag sets an environment variable within the container. Specifically, it sets the environment variable POSTGRES_PASSWORD to the value topsecretpassword. This is typically used to configure the containerized application.
-d: This flag tells Docker to run the container in detached mode, meaning it will run in the background and won’t occupy the current terminal session.
-p 5432:5432: This flag specifies port mapping, binding port 5432 on the host machine to port 5432 in the container. Port 5432 is the default port used by PostgreSQL, so this allows communication between the host and the PostgreSQL service running inside the container.
-v /Your/path/to/volume:/var/lib/postgresql/data: This flag specifies volume mapping, creating a persistent storage volume for the PostgreSQL data. The format is -v <host-path>:<container-path>. In this case, it maps a directory on the host machine (specified by /Your/path/to/volume) to the directory inside the container where PostgreSQL stores its data (/var/lib/postgresql/data). This ensures that the data persists even if the container is stopped or removed.
postgres: Finally, postgres specifies the Docker image to be used for creating the container. In this case, it indicates that the container will be based on the official PostgreSQL image from Docker Hub.

Code Contained Make Container From Code With Dockerfile

For creating a container from own code – Python scripts and dependencies, there are a few steps. The first step is creating a dockerfile. The dockerfile for our Scrapy app looks like so:

	# Use the official Python 3.9 image
FROM python:3.9

# Set the working directory in the container
WORKDIR /app

# Copy the current directory contents into the container at /app
COPY . /app

# Install required dependencies
RUN pip install --no-cache-dir -r requirements.txt

# Set the entry point command for running the Scrapy spider
ENTRYPOINT ["scrapy", "crawl", "spidermoney"]

This Dockerfile automates the process of building the image and running the container. It ensures that the container has all the necessary dependencies to execute the Python app.

Let’s break down each line of the Dockerfile:

FROM python:3.9: This instruction specifies the base image to build upon. It tells Docker to pull the Python 3.9 image from the Docker Hub registry. This image will serve as the foundation for our custom image.
WORKDIR /app: This instruction sets the working directory inside the container to /app. This is where subsequent commands will be executed, and it ensures that any files or commands are relative to this directory.
COPY . /app: This instruction copies the contents of the current directory on the host machine (the directory where the Dockerfile is located) into the /app directory within the container. It is a common practice to place the dockerfile in the project directory at the top level, for including application code and files inside the Docker image.
RUN pip install --no-cache-dir -r requirements.txt: This instruction runs the pip install command inside the container to install the Python dependencies listed in the requirements.txt file. The --no-cache-dir flag ensures that pip doesn’t use any cache when installing packages, which can help keep the Docker image smaller.
ENTRYPOINT ["scrapy", "crawl", "spidermoney"]: This instruction sets the default command to be executed when the container starts. It specifies that the scrapy crawl spidermoney command should be run. This command tells Scrapy, a web crawling framework, to execute a spider named “spidermoney”. When the container is launched, it will automatically start executing this command, running the Scrapy spider.

The dockerfile is a recipe. The steps to prepare the dish are as follows:

Build the image with docker build -t scrapy-app .. The dockerfile is a series of instructions to build a docker image from. The build process downloads the base layer and adds a layer with every instruction. Thus, layer by layer, a new image in constructed which has everything needed to spin up a container that runs the app.
Spin up the app with the docker run command. For example: docker run --name scrapy_service --network scrappy-net -e DB_HOST=postgres_service scrapy-app. This command creates container named ‘scrapy_service’ from the image ‘scrapy-app’ and connects it to the network ‘scrappy-net’. The name of the container running Postgres is passed as an environment variable with -e flag to configure the app to work with the database instance.

Pushmi-Pullyu Push to and Pull From Hub

Launch the microservice by starting the docker containers. First, start 'postgres_service'. Then start 'scrapy_service'. The latter will scrape content, populate the database and exit. — *Launch the microservice from the docker dashboard. Start the Postgres container* (postgres_service) first and then the app container (scrapy_service). The app will run and exit when it has completed the scraping task. Check the database for new records added.

Once the microservice is containerized, launching it is as simple as starting the containers. Start the Postgres container first, followed by the app container. This can be easily done from the Docker dashboard.

Verifying Deployment:

Verify the results by examining the Postgres database before and after running the microservice. Running SQL queries can confirm that the spider has successfully crawled web domains and added new records to the database.

Postgres database Tableview client SQL query snapshot after microservice has run — *SQL query shows the before-after state of the Postgres database. The Python Scrapy spider has crawled web domains and added new records.*

Postgres database Tableview client SQL query snapshot before microservice has run — *SQL query shows the before-after state of the Postgres database. The Python Scrapy spider has crawled web domains and added new records.*

The figures show that 1232 records were added in this instance.

Now shipping the code is as simple as docker push to post the images to docker hub followed by docker pull on the target machine.

Conclusion The Docker Revolution

With Docker, shipping code becomes a streamlined process. Docker encapsulates applications and their dependencies, ensuring consistency across different environments. By containerizing both the database and the Python app, we simplify deployment and guarantee reproducibility, ultimately saving time and effort.

In conclusion, Docker revolutionizes the way we ship and deploy code, making it an indispensable tool for modern software development.

Being in the AI profession is more than just coding neural networks. Getting them into the hands of customers demands a thorough understanding of contemporary microservices architecture patterns. Learn from experienced instructors who can be your guide through our comprehensive coaching program powered by FastAI. Gain insights into cutting-edge techniques and best practices for building AI applications that not only meet the demands of today’s market but also seamlessly integrate into existing systems. From understanding advanced algorithms to mastering deployment strategies, our Ph.D. instructors will equip you with the skills and knowledge needed to succeed in the dynamic world of AI deployment. Join us and take your AI career to new heights with hands-on training and personalized guidance.

A use-case of generative AI to transform Prospecting & Competitive Intelligence

May 5 2024

Why RAG Is All the Rage in Generative AI

sanjaybhatikar Generative AI Artificial Intelligence, Chatbot, Competitor Intelligence, Generative AI, Prospecting, RAG, Retrieval Augmented Generation

The Need Prospecting & Competitor Intelligence

It is vital for a company to continuously monitor the changing business landscape for both threats and opportunities. This critical function involves prospecting opportunities and gathering intelligence on competitors, which is then synthesized by analysts into executive briefs with actionable recommendations. This task entails sifting through a wide array of information from diverse sources such as websites, regulatory filings, social media, and news articles, contributed by journalists, analysts, influencers, regulators, as well as internal company staff and officers. Automation efforts have often focused on casting a wider net, resulting in more pressure on downstream analysis and insight generation where the value lies. Recent rapid developments in Generative AI and the emergence of Large Language Models (LLMs) in Open Source have opened the door to automation of these downstream activities. In particular, the “co-pilot” mode of assistive AI offers the potential to increase productivity and reduce the risk of missed opportunities. We built a chatbot assistant in one such use-case for Bayer Crop Science USA.

The Solution Semantic Search & Retrieval Augmented Generation

The challenges of automating information digestion for insight generation can be distilled into two key problems: retrieving relevant information from a large corpus and using that information to contextualize responses. To address the first challenge, we employed Semantic Search, which allows natural language queries to be posed to a large text corpus, yielding ranked results. For the second challenge, we adopted Retrieval Augmented Generation (RAG), a technique that leverages Semantic Search results to provide transient context to a pre-trained Large Language Model (LLM) like ChatGPT. This approach avoids the computational intensity of fine-tuning LLMs and ensures that responses are guided by recent and relevant information without permanently embedding it into the neural network.

The Implementation Large Language Models

Retrieval Augmented Generation (RAG) uses text retrieved by Semantic Search as additional context for a Large Language Model to use in responding to a prompt. This is advantageous when feeding a low volume of high quality data to the neural network to use in addition to the high volume of low quality data typically used in training. It avoids the high cost of retraining.

Retrieval Augmented Generation (RAG) utilizes text retrieved by Semantic Search to augment a Large Language Model’s response to a prompt. Semantic Search employs embeddings, which represent text in a vector space. We implemented Semantic Search using the nomic-embed-text model within the ollama framework with Chroma as vector store. We wrapped a Streamlit UI around the vector store to enable search in a “standalone” mode. We used the LangChain framework to pull together the Retrieval Augmented Generation (RAG) workflow, with the Llama2 LLM from Meta with 13B parameters. The user’s prompt is routed to the Semantic Search engine to retrieve relevant documents, which then serve as context for the LLM to use in responding. This approach enhances the LLM’s ability to provide informed responses, effectively supporting the team’s work. The system has been lauded by users at Bayer Crop Science USA, who appreciate its capacity to provide tailored insights and streamline decision-making processes.

Empower yourself with the transformative capabilities of Deep Learning AI through our comprehensive coaching program centered on FastAI. Dive deep into the intricacies of AI and emerge equipped with invaluable skills in natural language processing, computer vision, and beyond. Our hands-on approach ensures that learners of all levels, from beginners to seasoned practitioners, grasp complex concepts with ease and confidence. Join us on a journey of discovery and mastery, where cutting-edge knowledge meets practical application, propelling you towards success in the dynamic world of AI.

The use of Artificial Intelligence in Biotechnology Research & Development strengthened the client's competitive position.

May 4 2024

Accelerate Key Parts of Biotech R&D Pipeline Using Computer Vision

sanjaybhatikar AI, FastAI, Python Biotechnology, Computer Vision, Deep Learning, Genetic Engineering, Plant Transformation, ResNet, Tissue Culture, VGG

The Need Accelerating Production

Tissue culture in Biotechnology Research & Development is a process of generating plants with modified genetics from tissue rather than seed. The low success rates of tissue culture become a well-known productivity choke-point. — Computer Vision based on Deep Learning has reached a point where the computer can do at least as well as the trained eye of a human expert. Together with democratization of Artificial Intelligence and availability of the algorithms as well as hardware to train and deploy models, traditional methods of production are seeing a transformation with breakthroughs in productivity. One such compelling use-case comes from Biotechnology Research & Development and is presented here.

In the realm of Biotech R&D, the cultivation of genetically engineered plants through tissue culture stands as a pivotal process, deviating from traditional seed-based methods to derive plants from embryos. Particularly in the case of corn, this intricate procedure spans 7-9 weeks, commencing with the manipulation of embryonic tissue, deliberately injured and exposed to agrobacterium tumefaciens, a specific bacteria facilitating DNA transfer. The outcome manifests as plant transformation, marked by the integration of foreign genes into the targeted specimen. Notably, the success rates of this process are dismally low, with a meager 2% or fewer embryos evolving into viable plants boasting the intended genetics. Hence, it becomes imperative to discern the success or failure of plant transformation at the earliest stages.

The Solution Succeed or Fail Fast

Historically, this determination was only feasible at the culmination of the 7-9 week period when plantlets emerged. Consequently, more than 98% of non-transformable embryos occupied valuable laboratory space and consumed essential resources. Given that plant transformation transpires within specialized chambers, maintaining stringent environmental conditions (temperature, humidity, and light), the inefficient utilization of space becomes a bottleneck in the downstream biotech R&D pipeline. To address this challenge, we conceptualized and implemented a groundbreaking solution: a Convolutional Neural Network (CNN) designed to scrutinize embryos and identify non-transformable ones within the initial two weeks post the initiation of plant transformation. This computer vision solution revolutionized the traditional approach, facilitating early detection and removal of approximately half of the non-transformable embryos. This, in turn, averted the necessity for a capital expenditure ranging between $10-15 million to expand the facility, effectively enhancing throughput by 1.5 to 2 times. Technologically, our approach incorporated an ensemble of deep learning models, achieving an impressive performance with over 90% sensitivity at 70% specificity during testing.

The Implementation Computer Vision Based on Deep Learning

Leveraging pre-trained models and neural transfer learning, we curated an extensive in-house dataset comprising 15,000 images meticulously labeled by cell biologists. These images, capturing various stages of embryonic development, were acquired using both an ordinary DSLR camera and a proprietary hyperspectral imaging robot. Our experimentation precisely determined the optimal timeframe for image acquisition post the initiation of plant transformation, establishing that images from a conventional DSLR were on par with those from the hyperspectral camera for the classification task. The impact of our work extends far beyond the confines of the laboratory, catalyzing a wave of innovations in computer vision within biotechnology R&D, spanning laboratories, greenhouses, and field applications. This progressive integration has not only optimized the R&D pipeline but has also significantly accelerated time-to-market, positioning our consultancy at the forefront of transformative advancements in the biotech sector

Interested in the power of deep learning to propel your Python skills to new heights? With our FastAI coaching, you will dive into the world of computer vision and other applications of deep learning. Our expertly crafted course is tailored for those with a minimum of one year of Python programming experience and taught by experienced Ph.D. instructors. FastAI places the transformative magic of deep learning directly into your hands. From day one, you’ll embark on a journey of practical application, building innovative apps and honing your Python proficiency along the way. Don’t just code —immerse yourself in the art and science of deep learning with FastAI.

March 31 2024

Access One Docker Container From Another in Microservices Architecture

sanjaybhatikar Python Docker, Docker Network, Microservices, network

Suppose I have two docker containers on a host machine- an app running in one container, requiring the use of database running in another container. The architecture is as shown in the figure.

Access One Docker Container From Another in Microservices Architecture

In the figure, the Postgres container is named ‘postgres_service’ and is based on official postgres image on docker. The data reside on a shared volume with local host. In this way, data are persisted even after container is removed.

The app container is named ‘scrapy_service’ and is based on an image created starting from official Python 3.9 base image for linux. The application code implements a web-crawler that scrapes financial news websites.

The web-scraper puts data into the postgres database. How to access postgres?

On the host machine, the postgres service is accessible at ‘localhost’ on port 5432. However, this will not work from inside the app container where ‘localhost’ is self-referential.

Solution? We create a docker network and connect both containers to it.

Create docker network and connect postgres container to it. Inspect and verify.
Spin up app’s container with connection to the network.

Create Docker Network Connect Postgres container to the network

docker create network scrappy-net creates network named scrappy-net.
docker network connect scrappy-net postgres_service connects the (running) postgres container to the network.
docker network inspect scrappy-net shows the network and what’s on it.

We now have a network ready to accept connections and exchange messages with other containers. Docker will do the DNS lookup with container name.

Spinup App Container Link app to Postgres container

docker build -t scrapy-app . builds the image named scrapy-app. The project directory must have dockerfile and requirements manifest. The entry-point that launches spider is scrapy crawl spidermoney or scrapy crawl spidermint.
docker run –-name scrapy_service –-network scrappy-net -e DB_HOST=postgres_service scrapy-app spins up the container with connection to the network. It launches the crawler as per the entry-point spec. The container exits when the job is done. Thereafter, it can be re-run as docker start scrapy_service with persistent network connection.
docker logs scrapy_service > /Users/sanjaybhatikar/Documents/tempt.txt 2>&1 saves the app’s strreamed output on stderr and stdout to temp text file for inspection.

January 9 2024

Address Privacy Concerns in Usage of Generative AI With Data Privacy Vault

sanjaybhatikar AI, Generative AI Data Vault, GDPR, Large Language Models, LLMs, Privacy

Summary: Privacy in the Age of Generative AI

LLMs have no DELETE button. There is no straightforward mechanism to “unlearn” specific information, no equivalent to deleting a row in your database user table. In a world where “right to be forgotten” is central to many privacy regulations, using LLMs presents some difficult challenges.
Data Privacy Vault is IEEE’s recommended architecture for securely storing, managing and utilizing sensitive customer’s Personally Identifiable Information (PII).
Data of a sensitive nature can seep into LLM during training as well as inference. During training, information of a sensitive na ture may be ingested from documents that are not anonymized or redacted for sensitive information. During inference, a prompt may inadvertently provide sensitive information. For example, a prompt that requests the LLM to summarize a will with sensitive information.
Only way to delete information from an LLM is to train it from scratch! Hence, don’t let sensitive information get in in the first place.
Key consideration for anonymization is Referential Integrity.

Synthetic Data ⇄ Original Sensitive Information
Is private LLM a solution? As opposed to managed service like OpenAI’s ChatGPT. Who will update the base model to keep up with new releases? Expensive!
Base model v. Fine-tuning (Andrej Karpathy’s Intro to Large Language Models)
Train an LLM on high volume of low quality data, then fine-tune it with low volume of high quality data. From the YouTube video “Intro to Large Language Models” by Andrej Karpathy.
Private LLM does not address privacy concerns!
Privacy: WHO sees WHAT
The expectation of privacy can be summarized in a nutshell as WHO sees WHAT in a corporate data system.
Data Privacy Vault – Principle:

The Data Privacy Vault tokenizes personal and other sensitive information in way that preserves referential integrity.
Tokenization: Swap sensitive data for tokens. A token is a reference for some sensitive data somewhere else. Thus, reference something while providing obfuscation.
The data ingested via the application frontend has any sensitive data including Personally Identifiable Information (PII) replaced by tokens generated by the Data Privacy Vault.
Fig. shows sensitive data being replaced by tokens by the application frontend through the Data Privacy Vault. The assets downstream of the app – app database, warehouse, reports and/or analytics – then only “know” the tokenized data. These are not tokens in the sense of tokenization in LLMs but tokens that hold a reference to the original data which is stored in the Data Privacy Vault.Vault not only stores and generates de-identified data, it tightly controls access to sensitive data through a zero-trust model, where user accounts are managed through explicit access control policies. 777-123-4567 → ABC4567
Fig. shows sensitive data under explicit access control to address WHO sees WHAT.

WHO sees WHAT? The team that has access to the Data Privacy Vault is verifiably in-scope of Identity & Access Management (IAM). Sensitive information can be redacted according to subscriber roles.

Using privacy enhancing techniques such as polymorphic encryption and tokenization, sensitive data can be de-identified in a way that preserves referential integrity.
Prompt Seepage: Sensitive data may also enter a model during inference. For example, a prompt is created asking for a summary of a will. The vault detects the sensitive information, de-identifies it, and shares a non-sensitive version of the prompt with the LLM. Since the LLM was trained on non-sensitive and de-identified data, inference can be carried out as normal.

Data Privacy Vault Architecture

Fig. shows the flow of information in a Data Privacy Vault architecture.

🚀 Dive into the cutting-edge world of Artificial Intelligence with my hands-on class using FastAI! In this immersive learning experience, you’ll not only grasp the fundamentals of AI but also explore contemporary challenges and solutions, including the privacy and compliance issues associated with powerful tools like large language models (LLMs). Get hands-on experience with state-of-the-art techniques while unraveling the complexities of generative AI. Join me on this exciting journey to master FastAI and gain insights into the latest advancements in AI technology. Don’t just follow the AI wave—ride it with confidence in my dynamic and practical AI class! 🤖💡 #AI #FastAI #HandsOnLearning #TechInnovation

January 7 2024

Unorthodox Onomastics: How AI Can Assist You in Discovering the Perfect Name for Your Child

sanjaybhatikar PyTorch Andrej Karpathy, Artificial Intelligence, Bigram model, Deep Learning, Embeddings, Large Language Models, Onomastics

What is notable about this collection of names?

ann.
akela.
az.
arileri.
chaiadayra

They share a common origin – each one was generated by a Deep Learning model. Intrigued to understand how? Large Language Models (LLMs) are multifaceted, handling complex tasks such as sentence completion, Q&A, text summarization, and sentiment analysis. LLMs, emphasizing their substantial size, are intricate models with tens or hundreds of billions of parameters, honed on vast datasets totaling 10 terabytes. However, it is possible to appreciate the foundation of how machines learn meaning from text starting from a seemingly straightforward concept – the bigram model.

The bigram model operates on the principle of predicting one token from another. For simplicity, let’s consider tokens as characters in the English alphabet. This principle closely aligns with the essence of LLMs like ChatGPT, which predict subsequent tokens based on preceding ones, iteratively generating coherent text and even entire computer programs. In our bigram model, however, we predict one character from the next, utilizing a 26×26 matrix of probabilities. Each entry in the matrix represents the probability of a particular character appearing after another. This matrix, with some modifications, constitutes our model. Our goal? To generate names.

Bigram Matrix

The bigram matrix shows the frequency of occurrence of one token following another (“bigram”) in a given dataset. The tokens here are characters of the English alphabet plus one additional token to mark the start or end of a word. The dataset is a collection of 30,000+ names from a public database. The entry in a cell is the count of occurrences of the character in the column following the character in the row.

We introduce an extra character to mark the start or end of a word, expanding from a 26×26 matrix to a 27×27 matrix. The matrix entries arise from patterns observed in a training dataset comprising over 30,000 names from a public database. Raw occurrence counts shown are transformed into probabilities for sampling. Generating a name involves starting with the character that marks the start of a word, sampling the 1st character from the multinomial probability distribution in the 1st row, recycling that character as input to predict the 2nd character, and so forth until reaching the end character. The resulting names, like junide, janasah, p, cony, and a, showcase the model’s unique outputs.

Considering these names, one might favor Janasah! But there’s room for enhancement. Enter the neural network! How would this transition occur? Instead of relying on a lookup matrix, the neural network would predict one character from another. Here’s how:

Representation: Numerically represent each character for input and output with vectors of length 27, accounting for the extra character.
Data Sets: Divide the data into training, validation, and testing sets to train the model, guard against overfitting, and assess performance.
Loss Function: Utilize negative log-likelihood, common in such scenarios, calculated through a softmax layer to generate a probability distribution.
Training: Adjust model parameters using calculated gradients and backpropagation through the neural network.

Refer to the Colab notebook for the implementation with detailed notes. So we have trained a neural network to do what we could do with a matrix. What’s the big deal?

For one, we can use a longer sequence of characters as input to the neural network, giving the model more material to work with to make better predictions. This block of characters provides not just one sequence, but all sequences including and up to the last character as context to the neural network. This already goes beyond what we can do with matrices with counts of occurrences of bigrams.

But how does a neural network learn meaning in text? Part of the answer lies in embeddings. Every token is converted into a numerical vector of fixed size, thus allowing a spatial representation in which meaningful associations can take shape. We allow the embeddings to emerge as properties of a neural network during the training process. The deeper layers of the neural network use these associations as stepping stones to enrich structure in keeping with the nuances and intricacies of linguistic constructs.

Talk about layered meaning!

Wrapping up our baby steps in language models, we’ve transitioned from basic bigram models to deep neural networks, exploring the evolution from mechanical predictions to embeddings that allow associations that capture primitives of nuanced linguistic structure. We get a glimpse into the potential of these models to grasp the intricacies of language, beyond generating names. As we take these initial steps, the horizon of possibilities widens, promising not only enhanced language generation but also advancements in diverse applications, hinting at a future where machines engage with human communication in increasingly sophisticated ways.

Explore the fascinating world of Artificial Intelligence in my upcoming class, powered by FastAI! We’ll embark on a hands-on journey through the evolving landscape of AI, building models with state-of-the-art architecture and learning to wield the power of Large Language Models (LLMs). Whether you’re a beginner or seasoned enthusiast, this class promises a dynamic and engaging exploration into the realm of AI, equipping you with the skills to navigate and innovate in this rapidly evolving field. Join me for an exciting learning experience that goes beyond theory, fueled by the practical insights and advancements offered by FastAI.

November 7 2023

A Mondrianesque Masterpiece with App Layouts in AppInventor

sanjaybhatikar AppInventor, Blog CSS, Layout, Mondrian

In the world of app development, we often find ourselves humming to the tune of “Little boxes on the hillside, little boxes made of ticky tacky.” While our digital boxes may lack the charming hues of green, pink, blue, and yellow, they still bear a striking resemblance to their real-world counterparts.

You can find out more about this delightful song by folk singer-songwriter Malvina Reynolds on this wiki.

Little boxes on the hillside, Little boxes made of tickytacky

Little boxes on the hillside, little boxes all the same

There’s a green one and a pink one and a blue one and a yellow one

And they’re all made out of ticky tacky and they all look just the same.

These “ticky tacky” boxes are none other than the trusty rectangular frames called arrangements, the unsung heroes of screen design in App Inventor. Much like their physical counterparts, these arrangements serve as the foundation for organizing the various elements that make up your app’s user interface. It’s a simple concept but one that works wonders.

Within these rectangular frames, elements align themselves neatly, jostling for space from left to right in a horizontal arrangement. If you want to pile them up, just opt for the vertical arrangement. Buttons, labels, images, and other visual components can be neatly arranged using these building blocks.

But what if you prefer a bit of artistic flair? Don’t fret! When you choose not to employ an arrangement, elements will naturally stack up. Moreover, these arrangements can play nice with each other, nestling neatly within one another to create a more complex layout.

A Mondrianesque Masterpiece with App Layouts in AppInventor

Image Viewer

The seelcted image is displayed in a view port at the top of the screen.

Image Selector

The buttons are arranged using a HorizontalArrangement from the Layout drawer.

Map With Navigation

The Map component from Maps drawer is used along with Navigation component from the same drawer.

Take, for instance, the “Slideshow” app. It employs the default Screen layout so the image viewer, the row of buttons and the map component stack vertically. The first two have Height set to Automatic... and Width set to Fill parent.... These take up height to fit the contents. The last has the Height and Width both set to Fill Parent. This takes up the remaining available real estate. We only use HoriontalArrangement for the buttons which are stacked horizontally in a row. It’s like a choreographed dance of elements, all thanks to these arrangements.

One of the perks of this layout method is “relative dimensioning.” This means that you can specify the size of elements relative to their containers, all the way up to the outermost container—the screen itself. This ensures a consistent look across devices with different screen sizes, making your app appear polished and professional.

Despite its simplicity, AppInventor empowers creative minds to craft aesthetically pleasing arrangements with complex structures, much like the iconic works of Piet Mondrian. With the versatile use of horizontal and vertical arrangements, you can orchestrate a symphony of colors, shapes, and proportions, evoking the spirit of abstract art within the confines of mobile app design. Your Mondrian-inspired creation in AppInventor showcases the fusion of technology and artistic expression, proving that even the most unassuming tools can spark a touch of genius.

Readers with a background in web development may want to check out Jen Simmons blogpost on Mondrian layouts with CSS Grid.

For animations, AppInventor offers a Canvas component on which to arrange ImageSprite components. Find these in the Drawing and Animation drawer. These allow for shaping, positioning, and sizing animated characters and props on a backdrop in line with the requirements of animation. It’s a world of possibilities, but with a boundary—sprites are confined within the canvas perimeter, and only sprites can “live” there. We will delve more into animation with canvas and sprites in another blogpost.

Happy designing!

Do you have a brilliant app idea but feel overwhelmed, thinking it’s a task best left to computer wizards? It may be easier than you think. Our AppInventor course is here to demystify app development and empower you to turn your concepts into reality. Let our hands-on approach guide you through the process. Each week, you’ll design and build real apps, learning key concepts such as layouts discussed in this blogpost. By the end of the course, you’ll be equipped with the skills and confidence to bring your unique ideas to life. Join us, and discover the art of making apps, one step at a time.

November 4 2023

The Essence of Event-Driven Programming in Android App Development

sanjaybhatikar AppInventor Android, Animation, Callback, Canvas, Event-Driven Programming, Event-Handler, Molemash, Sprite

Imagine you are eagerly awaiting a package delivery at your doorstep. In this scenario, you have two distinct approaches to staying informed about when your eagerly anticipated package arrives. First, you could repeatedly venture to your doorstep, hoping to catch a glimpse of your precious cargo. Alternatively, you could simply relax and await the familiar sound of the doorbell, which signals the arrival of the deliveryman and your package. The moment the doorbell rings, you spring into action, promptly picking up your long-awaited package.

These two methods of package monitoring are symbolic of different paradigms in the world of programming. The event-driven paradigm, often exemplified by the doorbell scenario, is akin to following a set of instructions as if they were a “mad-lib,” constructed as follows: “WHEN event occurs, DO action.” In this construct, the first part represents the event, while the second part signifies the callback or action to be executed in response to the event. But how does this analogy relate to the realm of Android app development?

In the context of Android app development, this paradigm is of paramount importance. It hinges on the execution of a callback function when a specific event transpires. These events can be intrinsically tied to the actions of the phone’s user, such as tapping the screen or swiping a finger across it. However, user interactions are just one facet of the vast spectrum of events. Another category of events revolves around time, driven by clocks and timers. The callback function springs into action when a predetermined time interval lapses, and the timer signals its completion.

To illustrate this concept, let’s consider an app known as “Mole Mash”. This app employs a timer to move the mole to a random location on the screen. The objective is to stab the mole and score points. In this instance, event-handler implements the logic as follows: WHEN timer goes off DO move the mole to a random location. In AppInventor, the green block represents the event-handler that sets off the timer at regular intervals. The interval is a configurable property and can be set manually or programmatically. The purple block inside is the callback procedure that moves the mole to a random location. The callback uses a canned procedure for random number generation to set the X and Y coordinates of the mole independently. Thus, the mole comes to life!

In essence, event-driven programming in the context of Android app development hinges on the idea of responding to various events, whether initiated by the user’s actions or the passage of time, by executing predefined actions or callbacks. This flexible and dynamic paradigm enables developers to create interactive and responsive applications that cater to a multitude of scenarios and user experiences.

📱 Explore Android App Development with AppInventor! 🤖

Discover the exciting world of Android app development with AppInventor, where you can build practical skills while crafting innovative apps. Our class offers a hands-on approach to learning key software concepts, such as event-driven programming. Throughout the course, students have the opportunity to create a new app every week, gaining valuable experience and insights into the world of app development. Join us on this coding journey and unlock the potential of interactive, responsive applications. Dive into the world of possibilities with AppInventor! 💡👩‍💻📦