DZone Spotlight

Wednesday, July 3 View All Articles »

You Can Shape Trend Reports: Participate in DZone Research Surveys + Enter the Prize Drawings!

By Caitlin Candelmo

Hello, DZone Community! We have several surveys in progress as part of our research for upcoming Trend Reports. We would love for you to join us by sharing your experiences and insights (anonymously if you choose) — readers just like you drive the content that we cover in our Trend Reports. check out the details for each research survey below Over the coming months, we will compile and analyze data from hundreds of respondents; results and observations will be featured in the "Key Research Findings" of our Trend Reports. Security Research Security is everywhere; you can’t live with it, and you certainly can’t live without it! We are living in an entirely unprecedented world — one where bad actors are growing more sophisticated and are taking full advantage of the rapid advancements in AI. We will be exploring the most pressing security challenges and emerging strategies in this year’s survey for our August Enterprise Security Trend Report. Our 10-12-minute Enterprise Security Survey explores: Building a security-first organization Security architecture and design Key security strategies and techniques Cloud and software supply chain security At the end of the survey, you're also able to enter the prize drawing for a chance to receive one of two $175 (USD) e-gift cards! Join the Security Research Data Engineering Research As a continuation of our annual data-related research, we're consolidating our database, data pipeline, and data and analytics scopes into a single 12-minute survey that will guide help the narratives of our July Database Systems Trend Report and data engineering report later in the year. Our 2024 Data Engineering Survey explores: Database types, languages, and use cases Distributed database design + architectures Data observability, security, and governance Data pipelines, real-time processing, and structured storage Vector data and databases + other AI-driven data capabilities Join the Data Engineering Research You'll also have the chance to enter the $500 raffle at the end of the survey — five random people will be drawn and will receive $100 each (USD)! Cloud and Kubernetes Research This year, we're combining our annual cloud native and Kubernetes research into one 10-minute survey that dives further into these topics as they relate to both one another and at the intersection of security, observability, AI, and more. DZone's research will be informing these Trend Reports: May – Cloud Native: Championing Cloud Development Across the SDLC September – Kubernetes in the Enterprise Our 2024 Cloud Native Survey covers: Microservices, container orchestration, and tools/solutions Kubernetes use cases, pain points, and security measures Cloud infrastructure, costs, tech debt, and security threats AI for release management + monitoring/observability Join the Cloud Native Research Don't forget to enter the $750 raffle at the end of the survey! Five random people will be selected to each receive $150 (USD). Your responses help inform the narrative of our Trend Reports, so we truly cannot do this without you. Stay tuned for each report's launch and see how your insights align with the larger DZone Community. We thank you in advance for your help! —The DZone Publications team More

AWS CDK: Infrastructure as Abstract Data Types

By Nicolas Duminil

CORE

Infrastructure as Code (IaC), as the name implies, is a practice that consists of defining infrastructure elements with code. This is opposed to doing it through a GUI (Graphical User Interface) like, for example, the AWS Console. The idea is that in order to be deterministic and repeatable, the cloud infrastructure must be captured in an abstract description based on models expressed in programming languages to allow the automation of the operations that otherwise should be performed manually. AWS makes several IaC tools available, as follows: CloudFormation: A provisioning tool able to create and manage cloud resources, based on templates expressed in JSON or YAML notation AWS Amplify: An open-source framework that provides developers with anything they need to deliver applications connecting AWS infrastructure elements, together with web and mobile components AWS SAM (Serverless Application Model): A tool that facilitates the integration of AWS Lambda functions with services like API Gateway, REST API, AWS SNS/SMQ, DynamoDB, etc. AWS SDK (Software Development Kit): An API that provides management support to all AWS services using programming languages like Java, Python, TypeScript, and others AWS CDK (Cloud Development Kit): This is another API like the SDK but more furnished, allowing not only management of AWS services, but also to programmatically create, modify, and remove CloudFormation stacks, containing infrastructure elements. It supports many programming languages, including but not limited to Java, Python, TypeScript, etc. Other non-Amazon IaC tools exist, like Pulumi and Terraform, and they provide very interesting multi-cloud support, including but not limited to AWS. For example, exactly like AWS CDK, Pulumi lets you define cloud infrastructure using common programming languages and, like CloudFormation, Terraform uses a dedicated declarative notation, called HCL (HashiCorp Configuration Language). This post is the first part of a series that aims to examine CDK in-depth as a high-level object-oriented abstraction to define cloud infrastructure by leveraging the power of programming languages. Introduction to AWS CDK In AWS's own definition, CDK is an open-source software development framework that defines AWS cloud resources using common programming languages. Here, we'll be using Java. It's interesting to observe from the beginning that as opposed to other IaC tools like CloudFormation or Terraform, the CDK isn't defined as being just an infrastructure provisioning framework. As a matter of fact, in AWS meaning of the term, the CDK is more than that: an extremely versatile IaC framework that unleashes the power of programming languages and compilers to manage highly complex AWS cloud infrastructure with code that is, compared to HCL or any other JSON/YAML based notation, much more readable and extensible. As opposed to these other IaC tools, with the CDK one can loop, map, reference, write conditions, use helper functions, in a word, take full advantage of the programming languages power. But perhaps the most important advantage of the CDK is its Domain Specific Language (DSL)-like style, thanks to the extensive implementation of the builder design pattern that allows the developer to easily interact with the AWS services without having to learn convoluted APIs and other cloud provisioning syntaxes. Additionally, it makes possible powerful management and customizations of reusable components, security groups, certificates, load balancers, VPCs (Virtual Private Cloud), and others. The CDK is based on the concept on Construct as its basic building block. This is a powerful notion that allows us to abstract away details of common cloud infrastructure patterns. A construct corresponds to one or more synthesized resources, which could be a small CloudFormation stack containing just an S3 bucket, or a large one containing a set of EC2 machines with the associated AWS Sytem Manager parameter store configuration, security groups, certificates, and load balancers. It may be initialized and reused as many times as required. The Stack is a logical group of Construct objects. It can be viewed as a chart of the components to be deployed. It produces a declarative CloudFormation template, a Terraform configuration, or a Kubernetes manifest file. Last but not least, the App is a CDK concept which corresponds to a tree of Construct objects. There is a root Appwhich may contain one or more Stack objects, containing in turn one or more Construct objects, that might encompass other Construct objects, etc. The figure below depicts this structure. There are several examples here accompanying this post and illustrating it. They go from the most simple ones, creating a basic infrastructure, to the most complex ones, dealing with multi-region database clusters and bastion hosts. Let's look at some of them. A CDK Starter Let's begin with a starter project and build a CDK application that creates a simple stack containing only an S3 bucket. Installing the CDK is straightforward, as explained here. Once the CDK is installed and bootstrapped according to the above document, you may use its scaffolding functions in order to quickly create a project skeleton. Run the following command: Shell $ cdk init app --language java A bunch of text will be displayed while the CDK scaffolder generates your Maven project and, once finished, you may examine its structure as shown below: $ tree -I target . ├── cdk.json ├── pom.xml ├── README.md └── src ├── main │ └── java │ └── com │ └── myorg │ ├── TestApp.java │ └── TestStack.java └── test └── java └── com └── myorg └── TestTest.java 9 directories, 6 files This is your project skeleton created by the CDK scaffold. As you can see, there are a couple of Java classes, as well as a test one. They aren't very interesting and you can already remove them, together with the package com.myorg which won't probably fit your naming convention. But the real advantage of using the CDK scaffolding function is the generation of the pom.xml and cdk.json files. The first one drives your application build process and defines the required dependencies and plugins. Open it and you'll see: XML ... <dependency> <groupId>software.amazon.awscdk</groupId> <artifactId>aws-cdk-lib</artifactId> </dependency> ... <plugin> <groupId>org.codehaus.mojo</groupId> <artifactId>exec-maven-plugin</artifactId> <configuration> <mainClass>fr.simplex_software.aws.iac.cdk.starter.CdkStarterApp</mainClass> </configuration> </plugin> ... In order to develop CDK applications, you need the aws-cdk-lib Maven artifact. This is the CDK library containing all the required resources. The exec-maven-plugin is also required in order to run your application, once built and deployed. If you look in the cdk.json file that the cdk init command has generated for you, you'll see this: JSON ... "app": "mvn -e -q compile exec:java" ... This is the command that the CDK will use in order to build your application. Of course, you don't have to use the scaffolding function if you don't want to and, if you prefer to start from scratch, you can provide your own pom.xml since, after all, as a developer, you must be used to it. However, when it comes to the cdk.jsonfile, you better should get it generated. So, fine: you just got your project skeleton, and now you need to customize it to adapt it to your needs. Have a look at the cdk-starter project in the code repository. As you can see, there are two Java classes, CdkStarterApp and CdkStarterStack. The first one creates a CDK application by instantiating the software.amazon.awscdk.App class which abstracts the most basic CDK concept: the application. It's a recommended practice to tag the application, once instantiated, such that different automatic tools are able to manipulate it, according to different purposes. For example, we can imagine an automatic tool that removes all the test applications and, to do that, scans them looking for the tag environment:development. The goal of an application is to define at least one stack and this is what our application does by instantiating the CdkStarterStack class. This class is a stack as it extends the software.amazon.awscdk.Stack one. And that's in its constructor that we'll be creating an S3 bucket, as shown by the code snippet below: Java Bucket bucket = Bucket.Builder.create(this, "my-bucket-id") .bucketName("my-bucket-" + System.getenv("CDK_DEFAULT_ACCOUNT")) .autoDeleteObjects(true).removalPolicy(RemovalPolicy.DESTROY).build(); Here we created an S3 bucket having the ID of my-bucket-id and the name of my-bucket to which we've appended the current user's default account ID. The reason is that the S3 bucket names must be unique worldwide. As you can see, the class software.amazon.awscdk.services.s3.Bucket used here to abstract the Amazon Simple Storage Service implements the design pattern builder which allows to define, in a DSL-like manner, properties like the bucket name, the auto-delete, and the removal policy, etc. So this is our first simple CDK application. The following line in the CdkStarterApp class: Java app.synth(); ... is absolutely essential because it produces ("synthesizes," in the CDK parlance) the associated AWS CloudFormation stack template. Once "synthesized," it may be deployed and used. So here is how: Shell $ git clone https://github.com/nicolasduminil/cdk.git $ cd cdk/cdk-starter $ mvn package $ cdk deploy --requireApproval=never A bunch of text will be again displayed and, after a while, if everything is okay, you should see a confirmation of your stack's successful deployment. Now, in order to check that everything worked as expected, you can the list of your deployed stack as follows: Shell $ aws cloudformation list-stacks --stack-status-filter CREATE_COMPLETE It is critical to filter the output list of the existent stack by their current status (in this case CREATE_COMPLETE), such that to avoid retrieving dozens of irrelevant information. So, you should see something like: JSON { "StackSummaries": [ ... { "StackId": "arn:aws:cloudformation:eu-west-3:...:stack/CdkStarterStack/83ceb390-3232-11ef-960b-0aa19373e2a7", "StackName": "CdkStarterStack", "CreationTime": "2024-06-24T14:03:21.519000+00:00", "LastUpdatedTime": "2024-06-24T14:03:27.020000+00:00", "StackStatus": "CREATE_COMPLETE", "DriftInformation": { "StackDriftStatus": "NOT_CHECKED" } } ... ] } Now, you can get more detailed information about your specific stack: Shell $ aws cloudformation describe-stacks --stack-name CdkStarterStack The output will be very verbose, and we'll not reproduce it here, but you should see interesting information like: JSON ... "RoleARN": "arn:aws:iam::...:role/cdk-hnb659fds-cfn-exec-role-...-eu-west-3", "Tags": [ { "Key": "environment", "Value": "development" }, { "Key": "application", "Value": "CdkApiGatewayApp" }, { "Key": "project", "Value": "API Gateway with Quarkus" } ], ... And of course, you may check that your S3 bucket has been successfully created: Shell $ aws s3api list-buckets --query "Buckets[].Name" Here, using the option --query "Buckets[].Name, you filter the output such that only the bucket name is displayed and you'll see: JSON [ ... "my-bucket-...", ... ] If you want to see some properties (for example, the associated tags): Shell $ aws s3api get-bucket-tagging --bucket my-bucket-... { "TagSet": [ { "Key": "aws:cloudformation:stack-name", "Value": "CdkStarterStack" }, { "Key": "environment", "Value": "development" }, { "Key": "application", "Value": "CdkStarterApp" }, { "Key": "project", "Value": "The CDK Starter projet" }, { "Key": "aws-cdk:auto-delete-objects", "Value": "true" } ] } Everything seems to be okay and you may conclude that your first test with the CDK is successful. And since you have deployed now a stack with an S3 bucket, you are supposed to be able to use this bucket, for example, to upload files in it, to download them, etc. You can do that by using AWS CLI as shown here. But if you want to do it with the CDK, you need to wait for the next episode. While waiting for that, don't forget to clean up your AWS workspace such that to avoid being invoiced! Shell $ cdk destroy --all aws s3 rm s3://my-bucket-... --recursive aws s3 rb s3://my-bucket-... Have fun and stay tuned! More

Trend Report

Low-Code Development

Low code, no code, citizen development, AI automation, scalability — if you work in the tech world, it's likely that you have been encouraged to use tools in at least one of these spaces. And it's for a good reason as Gartner has projected that by 2025, 70% of applications developed within organizations will have been built using low- and/or no-code technologies. So does the practice live up to the hype? Year over year, the answer is a resounding "yes" as the industry continues to evolve. Organizations have an increased demand for more frequent application releases and updates, and with that comes the need for increased efficiencies. And this is where low-code and no-code development practices shine. Sprinkle AI automation into low- and no-code development, and the scalability opportunities are endless. This Trend Report covers the evolving landscape of low- and no-code development by providing a technical exploration of integration techniques into current development processes, the role AI plays in relation to low- and no-code development, governance, intelligent automated testing, and adoption challenges. In addition to findings from our original research, technical experts from the DZone Community contributed articles addressing important topics in the low code space, including scalability, citizen development, process automation, and much more. To ensure that you, the developer, can focus on higher priorities, this Trend Report aims to provide all the tools needed to successfully leverage low code in your tech stack.

Refcard #395

Open Source Migration Practices and Patterns

By Nuwan Dias

CORE

Open Source Migration Practices and Patterns

Refcard #171

MongoDB Essentials

By Abhishek Gupta

CORE

PostgreSQL BiDirectional Replication

As you can understand from my previous blogs I am really into PostgreSQL. Previously we ran Debezium in Embedded mode. Behind the scenes, Debezium consumes the changes that were committed to the transaction log. This happens by utilizing the logical decoding feature of PostgreSQL. In this blog, we shall focus on replication and more specifically bidirectional replication. To achieve bidirectional replication in PostgreSQL we need the module pglogical. You might wonder about the difference between logical decoding and pglogical. Essentially, logical decoding has its origins in PgLocigal. View PgLocial as a more featureful module while logical decoding is embedded in a PostgreSQL distribution. We will create a custom PostgreSQL Docker image and install PgLogical. Shell # Use the official PostgreSQL image as base FROM postgres:15 USER root RUN apt-get update; apt-get install postgresql-15-pglogical -y USER postgres Also, we need to have a PostgreSQL configuration that will enable PgLogical replication and conflict resolution. Shell listen_addresses = '*' port = 5432 max_connections = 20 shared_buffers = 128MB temp_buffers = 8MB work_mem = 4MB wal_level = logical max_wal_senders = 3 track_commit_timestamp = on shared_preload_libraries = 'pglogical' pglogical.conflict_resolution = 'first_update_wins' Let’s break this down. We added pglogical and we enabled track_commit_timestamp. By enabling this parameter PostgreSQL tracks the commit time of transactions. This will be crucial for the conflict resolution strategy. Now let’s see the conflict resolution. We selected ‘first_update_wins’; therefore, in case of two transactions operating on the same row, the transaction that finished first will be the one to be considered. Bidirectional replication is set up upon a table. Since we use Docker we shall provide an initialization script to PostgreSQL. The script will: Enable pglogical Create the table Add a target node Insert the row we shall run tests upon PLSQL #!/bin/bash set -e psql -v ON_ERROR_STOP=1 --username "$POSTGRES_USER" --dbname "$POSTGRES_DB" <<-EOSQL ALTER SYSTEM RESET shared_preload_libraries; CREATE EXTENSION pglogical; create schema test_schema; create table test_schema.employee( id SERIAL PRIMARY KEY, firstname TEXT NOT NULL, lastname TEXT NOT NULL, email TEXT not null, age INT NOT NULL, salary real, unique(email) ); SELECT pglogical.create_node( node_name := '$TARGET', dsn := 'host=$TARGET port=5432 dbname=$POSTGRES_DB user=$POSTGRES_USER password=$POSTGRES_PASSWORD'); SELECT pglogical.replication_set_add_table('default', 'test_schema.employee', true); insert into test_schema.employee (id,firstname,lastname,email,age,salary) values (1,'John','Doe 1','john1@doe.com',18,1234.23); EOSQL Let’s create the instances now using Docker Compose. YAML version: '3.1' services: postgres-a: build: ./pglogicalimage restart: always environment: POSTGRES_USER: postgres POSTGRES_PASSWORD: postgres TARGET: postgres-b volumes: - ./config/postgresql.conf:/etc/postgresql/postgresql.conf - ./init:/docker-entrypoint-initdb.d command: - "-c" - "config_file=/etc/postgresql/postgresql.conf" ports: - 5431:5432 postgres-b: build: ./pglogicalimage restart: always environment: POSTGRES_USER: postgres POSTGRES_PASSWORD: postgres TARGET: postgres-a volumes: - ./config/postgresql.conf:/etc/postgresql/postgresql.conf - ./init:/docker-entrypoint-initdb.d command: - "-c" - "config_file=/etc/postgresql/postgresql.conf" ports: - 5432:5432 We can get our instances up and running by issuing: docker compose up Docker Compose V2 is out there with many good features, you can find more about it in the book I authored: A Developer’s Essential Guide to Docker Compose. Since both instances are up and running we need to enable the replication. Therefore we shall subscribe the nodes to each other. Execute on the first node: PLSQL SELECT pglogical.create_subscription( subscription_name := 'postgres_b', provider_dsn := 'host=postgres-b port=5432 dbname=postgres user=postgres password=postgres', synchronize_data := false, forward_origins := '{}' ); Execute at the second node: PLSQL SELECT pglogical.create_subscription( subscription_name := 'postgres_a', provider_dsn := 'host=postgres-a port=5432 dbname=postgres user=postgres password=postgres', synchronize_data := false, forward_origins := '{}' ); You can use any PostgreSQL client that suits you. Alternatively, you can just use the psql client that comes packaged with the Docker Images. For example: Login to the first node: docker compose exec postgres-a psql --username postgres --dbname postgres Login to the second node: docker compose exec postgres-b psql --username postgres --dbname postgres Let’s see how conflict resolution will work now. On the first node, we shall run the following snippet: PLSQL BEGIN; UPDATE test_schema.employee SET lastname='first wins'; #before committing start transaction on postgres-b COMMIT; Don’t press commit immediately, instead take the time and before you commit the transaction start the following transaction on the second node. PLSQL BEGIN; UPDATE test_schema.employee SET lastname='second looses'; #make sure transaction on node postgres-a is committed first. COMMIT; This transaction will be committed after the transaction that takes place in postgres-a. Let’s check the logs on postgres-a-1: PLSQL postgres-a-1 | 2024-05-01 07:10:45.128 GMT [70] LOG: CONFLICT: remote UPDATE on relation test_schema.employee (local index employee_pkey). Resolution: keep_local. postgres-a-1 | 2024-05-01 07:10:45.128 GMT [70] DETAIL: existing local tuple {id[int4]:1 firstname[text]:John lastname[text]:first wins email[text]:john1@doe.com age[int4]:18 salary[float4]:1234.23} xid=748,origin=0,timestamp=2024-05-01 07:10:42.269227+00; remote tuple {id[int4]:1 firstname[text]:John lastname[text]:second looses email[text]:john1@doe.com age[int4]:18 salary[float4]:1234.23} in xact origin=1,timestamp=2024-05-01 07:10:45.125791+00,commit_lsn=0/16181C0 postgres-a-1 | 2024-05-01 07:10:45.128 GMT [70] CONTEXT: apply UPDATE from remote relation test_schema.employee in commit before 0/16181C0, xid 747 committed at 2024-05-01 07:10:45.125791+00 (action #2) from node replorigin 1 The transaction that took place on postgres-a finished first. Postgres-a received the replication data from the transaction of node postgres-b. A comparison was issued on the commit timestamp because the commit timestamp of the transaction on postgres-a was earlier the resolution was to keep the local changes. We can see the reverse on postgres-b: PLSQL postgres-b-1 | 2024-05-01 07:10:45.127 GMT [81] LOG: CONFLICT: remote UPDATE on relation test_schema.employee (local index employee_pkey). Resolution: apply_remote. postgres-b-1 | 2024-05-01 07:10:45.127 GMT [81] DETAIL: existing local tuple {id[int4]:1 firstname[text]:John lastname[text]:second looses email[text]:john1@doe.com age[int4]:18 salary[float4]:1234.23} xid=747,origin=0,timestamp=2024-05-01 07:10:45.125791+00; remote tuple {id[int4]:1 firstname[text]:John lastname[text]:first wins email[text]:john1@doe.com age[int4]:18 salary[float4]:1234.23} in xact origin=1,timestamp=2024-05-01 07:10:42.269227+00,commit_lsn=0/1618488 postgres-b-1 | 2024-05-01 07:10:45.127 GMT [81] CONTEXT: apply UPDATE from remote relation test_schema.employee in commit before 0/1618488, xid 748 committed at 2024-05-01 07:10:42.269227+00 (action #2) from node replorigin 1 Let’s check the result in the database. PLSQL postgres=# SELECT*FROM test_schema.employee; id | firstname | lastname | email | age | salary ----+-----------+------------+---------------+-----+--------- 1 | John | first wins | john1@doe.com | 18 | 1234.23 As expected the first transaction is the one that stayed. To wrap it up: We started two transactions in parallel We changed the same row We accepted the changes of the transaction that finished first That’s it. Hope you had some fun and now you have another tool for your needs. In the next blog, we shall examine PostgreSQL’s driver capabilities and how we can configure an automated failover to another instance.

By Emmanouil Gkatziouras

CORE

Spring AI: How To Write GenAI Applications With Java

Generative AI (GenAI) is currently a hot topic in the tech world. It's a subset of artificial intelligence that focuses on creating new content, such as text, images, or music. One popular type of GenAI component is the Large Language Model (LLM), which can generate human-like text based on a prompt. Retrieval-Augmented Generation (RAG) is a technique that enhances the accuracy and reliability of generative AI models by grounding them in external knowledge sources. While most GenAI applications and related content are centered around Python and its ecosystem, what if you want to write a GenAI application in Java? In this blog post, we'll look at how to write GenAI applications with Java using the Spring AI framework and utilize RAG for improving answers. What Is Spring AI? Spring AI is a framework for building generative AI applications in Java. It provides a set of tools and utilities for working with generative AI models and architectures, such as large language models (LLMs) and Retrieval-Augmented Generation (RAG). Spring AI is built on top of the Spring Framework, which is a popular Java framework for building enterprise applications, allowing those already familiar with or involved in the Spring ecosystem the ability to incorporate GenAI strategies into their already existing applications and workflow. There are also other options for GenAI in Java, such as Langchain4j, but we'll focus on Spring AI for this post. Creating a Project To get started with Spring AI, you'll need to either create a new project or add the appropriate dependencies to an existing project. You can create a new project using the Spring Initializr, which is a web-based tool for generating Spring Boot projects. When creating a new project, you'll need to add the following dependencies: Spring Web OpenAI (or other LLM model, such as Mistral, Ollama, etc.) Neo4j Vector Database (other vector database options also available) Spring Data Neo4j If you're adding these dependencies manually to an existing project, you can see the dependency details in today's related GitHub repository. The Spring Web dependency allows us to create a REST API for our GenAI application. We need the OpenAI dependency to access the OpenAI model, which is a popular LLM. The Neo4j Vector Database dependency allows us to store and query vectors, which are used for similarity searches. Finally, adding the Spring Data Neo4j dependency provides support for working with Neo4j databases in Spring applications, allowing us to run Cypher queries in Neo4j and map entities to Java objects. Go ahead and generate the project, and then open it in your favorite IDE. Looking at the pom.xml file, you should see that the milestone repository is included. Since Spring AI is not a general-availability release yet, we need to include the milestone repository to get the pre-release version of the dependencies. A Bit of Boilerplate The first thing that we need is a Neo4j database. I like to use the Neo4j Aura free tier because the instance is managed for me, but there are also Docker images and other methods. Depending on the LLM model you chose, you will also need an API key. For OpenAI, you can get one by signing up at OpenAI. Once you have a Neo4j database and an API key, you can set up the config in the application.properties file. Here's an example of what that might look like: Properties files spring.ai.openai.api-key=<YOUR API KEY HERE> spring.neo4j.uri=<NEO4J URI HERE> spring.neo4j.authentication.username=<NEO4J USERNAME HERE> spring.neo4j.authentication.password=<NEO4J PASSWORD HERE> spring.data.neo4j.database=<NEO4J DATABASE NAME HERE> Note: It's a good idea to keep sensitive information like API keys and passwords in environment variables or other locations external to the application. To create environment variables, you can use the export command in the terminal or set them in your IDE. We can set up Spring Beans for the OpenAI client and the Neo4j vector store that will allow us to access necessary components wherever we need them in our application. We can put these in our SpringAiApplication class by adding the following code to the class: Java @Bean public EmbeddingClient embeddingClient() { return new OpenAiEmbeddingClient(new OpenAiApi(System.getenv("SPRING_AI_OPENAI_API_KEY"))); } @Bean public Neo4jVectorStore vectorStore(Driver driver, EmbeddingClient embeddingClient) { return new Neo4jVectorStore(driver, embeddingClient, Neo4jVectorStore.Neo4jVectorStoreConfig.builder() .withLabel("Review") .withIndexName("review-embedding-index") .build()); } The EmbeddingClient bean creates a client for the OpenAI API and passes in our API key environment variable. Lastly, the Neo4jVectorStore bean configures Neo4j as the store for embeddings (vectors). We customize the configuration by specifying the label for the nodes that will store the embeddings, as Spring's default looks for Document entities. We also specify our index name for the embeddings (default is spring-ai-document-index). Data Set For this example, we'll use a dataset of books and reviews from Goodreads. You can pull a curated version of the dataset from here. The dataset contains information about books, as well as related reviews. I have already generated embeddings using OpenAI's API, so if you want to generate your own, you will need to comment out the final Cypher statement in the script and instead run the generate-embeddings.py script (or your custom version) to generate and load the review embeddings to Neo4j. Application Model Next, we need to create a domain model in our application to map to our database model. In this example, we'll create a Book entity that represents a book node. We'll also create a Review entity that represents a review of a book. The Review entity will have an embedding (vector) associated with it, which we'll use for similarity searches. These entities are standard Spring Data Neo4j code, so I won't show the code here. However, the full code for each class is available in the GitHub repository (Book class, Review class). We also need a repository interface defined so that we can interact with the database. While we will need to define a custom query, we'll come back and add that in a bit later. Java public interface BookRepository extends Neo4jRepository<Book, String> { } Next, the core of this application where all the magic happens is the controller class. This class will contain the logic for taking a search phrase provided by the user and calling the Neo4jVectorStore to calculate and return the most similar ones. We can then pass those similar reviews into a Neo4j query to retrieve connected entities, providing additional context in the prompt for the LLM. It will use all the information provided to respond with some similar book recommendations for the original searched phrase. Controller Our controller class contains a couple of common annotations, to start. We'll also inject the Neo4jVectorStore and BookRepository beans that we defined earlier, as well as the OpenAiChatClient for our embedding client. The next thing is to define a string for our prompt. This is the text that we will pass to the LLM to generate the response. We'll use the search phrase provided by the user and the similar reviews we find in the database to populate our prompt parameters in a few minutes. Next, we define the constructor for the controller class, which will inject the necessary beans. Java @RestController @RequestMapping("/") public class BookController { private final OpenAiChatClient client; private final Neo4jVectorStore vectorStore; private final BookRepository repo; String prompt = """ You are a book expert with high-quality book information in the CONTEXT section. Answer with every book title provided in the CONTEXT. Do not add extra information from any outside sources. If you are unsure about a book, list the book and add that you are unsure. CONTEXT: {context} PHRASE: {searchPhrase} """; public BookController(OpenAiChatClient client, Neo4jVectorStore vectorStore, BookRepository repo) { this.client = client; this.vectorStore = vectorStore; this.repo = repo; } //Retrieval Augmented Generation with Neo4j - vector search + retrieval query for related context @GetMapping("/rag") public String generateResponseWithContext(@RequestParam String searchPhrase) { List<Document> results = vectorStore.similaritySearch(SearchRequest.query(searchPhrase).withTopK(5).withSimilarityThreshold(0.8)); //more code shortly! } } Finally, we define a method that will be called when a user makes a GET request to the /rag endpoint. This method will first take a search phrase as a query parameter and pass that to the vector store's similaritySearch() method to find similar reviews. I have also added a couple of customization filters to the query by limiting to the top five results (.withTopK(5)) and only pull the most similar results (withSimilarityThreshold(0.8)). The actual implementation of Spring AI's similaritySearch() method is below. Java @Override public List<Document> similaritySearch(SearchRequest request) { Assert.isTrue(request.getTopK() > 0, "The number of documents to returned must be greater than zero"); Assert.isTrue(request.getSimilarityThreshold() >= 0 && request.getSimilarityThreshold() <= 1, "The similarity score is bounded between 0 and 1; least to most similar respectively."); var embedding = Values.value(toFloatArray(this.embeddingClient.embed(request.getQuery()))); try (var session = this.driver.session(this.config.sessionConfig)) { StringBuilder condition = new StringBuilder("score >= $threshold"); if (request.hasFilterExpression()) { condition.append(" AND ") .append(this.filterExpressionConverter.convertExpression(request.getFilterExpression())); } String query = """ CALL db.index.vector.queryNodes($indexName, $numberOfNearestNeighbours, $embeddingValue) YIELD node, score WHERE %s RETURN node, score""".formatted(condition); return session .run(query, Map.of("indexName", this.config.indexName, "numberOfNearestNeighbours", request.getTopK(), "embeddingValue", embedding, "threshold", request.getSimilarityThreshold())) .list(Neo4jVectorStore::recordToDocument); } } Then, we map the similar Review nodes back to Document entities because Spring AI expects a general document type. The Neo4jVectorStore class contains methods to convert Document to a custom record, as well as the reverse for the record to Document conversion. The actual implementation for those methods is shown next. Java private Map<String, Object> documentToRecord(Document document) { var embedding = this.embeddingClient.embed(document); document.setEmbedding(embedding); var row = new HashMap<String, Object>(); row.put("id", document.getId()); var properties = new HashMap<String, Object>(); properties.put("text", document.getContent()); document.getMetadata().forEach((k, v) -> properties.put("metadata." + k, Values.value(v))); row.put("properties", properties); row.put(this.config.embeddingProperty, Values.value(toFloatArray(embedding))); return row; } private static Document recordToDocument(org.neo4j.driver.Record neoRecord) { var node = neoRecord.get("node").asNode(); var score = neoRecord.get("score").asFloat(); var metaData = new HashMap<String, Object>(); metaData.put("distance", 1 - score); node.keys().forEach(key -> { if (key.startsWith("metadata.")) { metaData.put(key.substring(key.indexOf(".") + 1), node.get(key).asObject()); } }); return new Document(node.get("id").asString(), node.get("text").asString(), Map.copyOf(metaData)); } Back in our controller method for book recommendations, we now have similar reviews for the user's searched phrase. But reviews (and their accompanying text) aren't really helpful in giving us book recommendations. So now we need to run a query in Neo4j to retrieve the related books for those reviews. This is the retrieval augmented generation (RAG) piece of the application. Let's write the query in the BookRepository interface to find the books associated with those reviews. Java public interface BookRepository extends Neo4jRepository<Book, String> { @Query("MATCH (b:Book)<-[rel:WRITTEN_FOR]-(r:Review) " + "WHERE r.id IN $reviewIds " + "AND r.text <> 'RTC' " + "RETURN b, collect(rel), collect(r);") List<Book> findBooks(List<String> reviewIds); } In the query, we pass in the IDs of the reviews from the similarity search ($reviewIds) and pull the Review -> Book pattern for those reviews. We also filter out any reviews that have the text 'RTC' (which is a placeholder for reviews that don't have text). We then return the Book nodes, the relationships, and the Review nodes. Now we need to call that method in our controller and pass the results to a prompt template. We will pass that to the LLM to generate a response with a book recommendation list based on the user's search phrase (we hope!). :) Java //Retrieval Augmented Generation with Neo4j - vector search + retrieval query for related context @GetMapping("/rag") public String generateResponseWithContext(@RequestParam String searchPhrase) { List<Document> results = vectorStore.similaritySearch(SearchRequest.query(searchPhrase).withTopK(5).withSimilarityThreshold(0.8)); List<Book> bookList = repo.findBooks(results.stream().map(Document::getId).collect(Collectors.toList())); var template = new PromptTemplate(prompt, Map.of("context", bookList.stream().map(b -> b.toString()).collect(Collectors.joining("\n")), "searchPhrase", searchPhrase)); System.out.println("----- PROMPT -----"); System.out.println(template.render()); return client.call(template.create().getContents()); } Starting right after the similarity search, we call our new findBooks() method and pass in the list of review IDs from the similarity search. The retrieval query returns to a list of books called bookList. Next, we create a prompt template with the prompt string, the context data from the graph, and the user's search phrase, mapping the context and searchPhrase prompt parameters to the graph data (list with each item on the new line) and the user's search phrase, respectively. I have also added a System.out.println() to print the prompt to the console so that we can see what is getting passed to the LLM. Finally, we call the template's create() method to generate the response from the LLM. The returning JSON object has a contents key that contains the response string with the list of book recommendations based on the user's search phrase. Let's test it out! Running the Application To run our Goodreads AI application, you can use the ./mvnw spring-boot:run command in the terminal. Once the application is running, you can make a GET request to the /rag endpoint with a search phrase as a query parameter. Some examples are included next. Shell http ":8080/rag?searchPhrase=happy%20ending" http ":8080/rag?searchPhrase=encouragement" http ":8080/rag?searchPhrase=high%tech" Sample Call and Output + Full Prompt Call and returned book recommendations: Shell jenniferreif@elf-lord springai-goodreads % http ":8080/rag?searchPhrase=encouragement" The Cross and the Switchblade The Art of Recklessness: Poetry as Assertive Force and Contradiction I am unsure about 90 Minutes in Heaven: A True Story of Death and Life The Greatest Gift: The Original Story That Inspired the Christmas Classic It's a Wonderful Life I am unsure about Aligned: Volume 1 (Aligned, #1) Application log output: Shell ----- PROMPT ----- You are a book expert with high-quality book information in the CONTEXT section. Answer with every book title provided in the CONTEXT. Do not add extra information from any outside sources. If you are unsure about a book, list the book and add that you are unsure. CONTEXT: Book[book_id=772852, title=The Cross and the Switchblade, isbn=0515090255, isbn13=9780515090253, reviewList=[Review[id=f70c68721a0654462bcc6cd68e3259bd, text=encouraging, rating=4]]] Book[book_id=89375, title=90 Minutes in Heaven: A True Story of Death and Life, isbn=0800759494, isbn13=9780800759490, reviewList=[Review[id=85ef80e09c64ebd013aeebdb7292eda9, text=inspiring & hope filled, rating=5]]] Book[book_id=1488663, title=The Greatest Gift: The Original Story That Inspired the Christmas Classic It's a Wonderful Life, isbn=0670862045, isbn13=9780670862047, reviewList=[Review[id=b74851666f2ec1841ca5876d977da872, text=Inspiring, rating=4]]] Book[book_id=7517330, title=The Art of Recklessness: Poetry as Assertive Force and Contradiction, isbn=1555975623, isbn13=9781555975623, reviewList=[Review[id=2df3600d488e182a3ef06bff7fc82eb8, text=Great insight, great encouragement, and great company., rating=4]]] Book[book_id=27802572, title=Aligned: Volume 1 (Aligned, #1), isbn=1519114796, isbn13=9781519114792, reviewList=[Review[id=60b9aa083733e751ddd471fa1a77535b, text=healing, rating=3]]] PHRASE: encouragement We can see that the LLM generated a response with a list of book recommendations based on the books found in the database (CONTEXT section of prompt). The results of the similarity search + graph retrieval query for the user's search phrase are in the prompt, and the LLM's answer uses that data for a response. Wrapping Up In today's post, you learned how to build a GenAI application with Spring AI in Java. We used the OpenAI model to generate book recommendations based on a user's search phrase. We used the Neo4j Vector Database to store and query vectors for similarity searches. We also mapped the domain model to our database model, wrote a repository interface to interact with the database, and created a controller class to handle user requests and generate responses. I hope this post helps to get you started with Spring AI and beyond. Happy coding! Resources Documentation: Spring AI Webpage: Spring AI project API: Spring AI - Neo4jVectorStore

By Jennifer Reif

CORE

Empowering Citizen Developers With Low- and No-Code Tools: Changing Developer Workflows and Empowering Non-Technical Employees to Build Apps

Editor's Note: The following is an article written for and published in DZone's 2024 Trend Report, Low-Code Development: Elevating the Engineering Experience With Low and No Code. The rise of low-code and no-code (LCNC) platforms has sparked a debate about their impact on the role of developers. Concerns about skill devaluation are understandable; after all, if anyone can build an app, what happens to the specialized knowledge of experienced programmers? While some skepticism toward low-code platforms remains, particularly concerning their suitability for large-scale, enterprise-level applications, it's important to recognize that these platforms are constantly evolving and improving. Many platforms now offer robust features like model-driven development, automated testing, and advanced data modeling, making them capable of handling complex business requirements. In addition, the ability to incorporate custom code modules ensures that specialized functionalities can still be implemented when needed. Yes, these tools are revolutionizing software creation, but it's time to move beyond the debate of their impact on the development landscape and delve into the practical realities. Instead of being a sales pitch of codeless platforms, this article aims to equip developers with a realistic understanding of what these tools can and cannot do, how they can change developer workflows, and most importantly, how you can harness their power to become more efficient and valuable in an AI-supported, LCNC-driven world. Leveraging Modern LCNC Platforms for Developer Workflows The financial benefits of LCNC platforms are undeniable. Reduced development costs, faster time to market, and a lighter burden on IT are compelling arguments. But it's the strategic advantage of democratizing application development by empowering individuals to develop solutions without any coding experience that drives innovation and competitive edge. For IT, it means less time fixing minor problems and more time on the big, important stuff. For teams outside of IT, it's like having a toolbox to build your own solutions. Need a way to track project deadlines? There's an app for that. Want to automate a tedious report? You can probably build it yourself. This shift doesn't mean that traditional coding skills are obsolete, though. In fact, they become even more valuable. Experienced developers can now focus on building reusable components, creating templates and frameworks for citizen developers, and ensuring that their LCNC solutions integrate seamlessly with existing systems. This shift is crucial as organizations can increasingly adopt a "two-speed IT" approach, balancing the need for rapid, iterative development with the maintenance and enhancement of complex core systems. Types of Tasks Suitable for LCNC vs. Traditional Development To understand how various tasks of traditional development would differ from using a codeless solution, consider the following table of typical tasks in a developer workflow: Table 1. Developer workflow tasks: LCNC vs. traditional development Task Category LCNC Traditional (Full-Code) Recommended Tool Developer Involvement Simple form building Ideal; drag-and-drop interfaces, pre-built components Possible but requires more manual coding and configuration LCNC Minimal; drag-and-drop, minimal configuration Data visualization Excellent with built-in charts/graphs, customizable with some code More customization options, requires coding libraries or frameworks LCNC or hybrid (if customization is needed) Minimal to moderate, depending on complexity Basic workflow automation Ideal; visual workflow builders, easy integrations Requires custom coding and integration logic LCNC Minimal to moderate; integration may require some scripting Front-end app development Suitable for basic UI, but complex interactions require coding Full control over UI/UX but more time consuming Hybrid Moderate; requires front-end development skills Complex integrations Limited to pre-built connectors, custom code often needed Flexible and powerful but requires expertise Full-code or hybrid High; deep understanding of APIs and data formats Custom business logic Not ideal; may require workarounds or limited custom code Full flexibility to implement any logic Full-code High; strong programming skills and domain knowledge Performance optimization Limited options, usually handled by the platform Full control over code optimization but requires deep expertise Full-code High; expertise in profiling and code optimization API development Possible with some platforms but limited in complexity Full flexibility but requires API design and coding skills Full-code or hybrid High; API design and implementation skills Security-critical apps Depends on platform's security features, may not be sufficient Full control over security implementation but requires expertise Full-code High; expertise in security best practices and secure coding Getting the Most Out of an LCNC Platform Whether you are building your own codeless platform or adopting a ready-to-use solution, the benefits can be immense. But before you begin, remember that the core of any LCNC platform is the ability to transform a user's visual design into functional code. This is where the real magic happens, and it's also where the biggest challenges lie. For an LCNC platform to help you achieve success, you need to start with a deep understanding of your target users. What are their technical skills? What kind of applications do they want to use? The answers to these questions will inform every aspect of your platform's design, from the user interface/user experience (UI/UX) to the underlying architecture. The UI/UX is crucial for the success of any LCNC platform, but it is just the tip of the iceberg. Under the hood, you'll need a powerful engine that can translate visual elements into clean, efficient code. This typically involves complex AI algorithms, data structures, and a deep understanding of various programming languages. You'll also need to consider how your platform will handle business logic, integrations with other systems, and deployment to different environments. Figure 1. A typical LCNC architecture flow Many organizations already have a complex IT landscape, and introducing a new platform can create compatibility issues. Choosing an LCNC platform that offers robust integration options, whether through APIs, webhooks, or pre-built connectors, is crucial. You'll also need to decide whether to adopt a completely codeless (no-code) solution or a low-code solution that allows for some custom coding. Additional factors to consider are how you'll handle version control, testing, and debugging. Best Practices to Empower Citizen Developers With LCNC LCNC platforms empower developers with powerful features, but it's the knowledge of how to use those tools effectively that truly unleashes their potential. The following best practices offer guidance on how to make the most of LCNC's capabilities while aligning with broader organizational goals. Leverage Pre-Built Components and Templates Most LCNC platforms offer pre-built components and templates as ready-made elements — from form fields and buttons to entire page layouts. These building blocks can help you bypass tedious manual coding and focus on the unique aspects of your application. While convenient, pre-built components may not always fit your exact requirements. Assess if customization is necessary and feasible within the platform. Begin with a pre-built application template that aligns with your overall goal. This can save significant time and provide a solid foundation. Explore the available components before diving into development. If a pre-built component doesn't quite fit, explore customization options within the platform before resorting to complex workarounds. Prioritize the User Experience Remember, even the most powerful application is useless if it's too confusing or frustrating to use. LCNC platforms are typically designed for rapid application development. Prioritizing core features first aligns with this philosophy, allowing for faster delivery of a functional product that can then be iterated upon based on user feedback. Before you start building, take the time to understand your end users' needs and pain points. Sketch out potential workflows, gather feedback from colleagues, and test your prototype with potential users. To avoid clutter and unnecessary features, the rule of thumb should be to focus on first developing the core functionalities that users need. Use clear labels, menus, and search functionality. A visually pleasing interface can significantly enhance user engagement and satisfaction. Align With Governance and Standards Your organization likely has established guidelines for data usage, security protocols, and integration requirements. Adhering to these standards not only ensures the safety and integrity of your application but also paves the way for smoother integration with existing systems and a more cohesive IT landscape. Be aware of any industry-specific regulations or data privacy laws that may apply to your application. Adhere to established security protocols, data-handling guidelines, and coding conventions to minimize risk and ensure a smooth deployment process. Formulate an AI-based runbook that mandates getting IT approval for your application before going live, especially if it involves sensitive data or integrations with critical systems. Conclusion Instead of viewing low code and traditional coding as an either/or proposition, developers should embrace them as complementary tools. Low-code platforms excel at rapid prototyping, building core application structures, and handling common functionalities; meanwhile, traditional coding outperforms in areas like complex algorithms, bespoke integrations, and granular control. A hybrid approach offers the best of both paradigms. It is also important to note that this is not the end of the developer's role but rather a new chapter. LCNC and AI are here to stay, and the smart developer recognizes that resisting this change is futile. Instead, embracing these tools opens up new avenues for career growth and impact. Embracing change, upskilling, and adapting to the evolving landscape can help developers thrive in an AI-based LCNC era, unlocking new levels of productivity, creativity, and impact. This is an excerpt from DZone's 2024 Trend Report, Low-Code Development: Elevating the Engineering Experience With Low and No Code.Read the Free Report

By Sudip Sengupta

CORE

Twenty Things Every Java Software Architect Should Know

As the software development landscape continues to evolve at a rapid pace, Java stands out as a foundational language that drives a multitude of applications on a global scale. In 2024, the role of a Java software architect has assumed unprecedented significance. Software architects must not only possess a profound comprehension of Java and its ecosystem but also remain current with the latest trends, technologies, and best practices in order to construct resilient, scalable, and efficient applications. This article meticulously examines 20 essential areas that every Java software architect should aim to master in 2024. Encompassing diverse topics such as microservices, cloud-native applications, reactive programming, and blockchain technology, these areas encapsulate the requisite skills and knowledge crucial for navigating the ever-changing realm of software architecture. Furthermore, each section provides insights into related technologies and recommends pertinent books to furnish architects with a comprehensive roadmap for remaining at the forefront of their field. 1. Microservices Architecture Adopting a microservices architecture entails reimagining applications as a collection of smaller, independently deployable services that are loosely coupled. This approach allows for individual development and scaling of services. Proficiency in this architectural style is essential for contemporary Java architects, as it facilitates the effective design and maintenance of robust, scalable, and resilient systems. Related Technologies Spring Boot A robust framework for creating stand-alone, production-grade Spring applications that you can "just run." Spring Cloud: Provides tools for developers to quickly build some of the common patterns in distributed systems (e.g., configuration management, service discovery, circuit breakers). Spring Data: Simplifies data access, making it easier to work with various databases and storage technologies. Spring Security: Comprehensive security services for Java applications, including authentication, authorization, and other security features. Quarkus Designed for Kubernetes and optimized for GraalVM and OpenJDK, Quarkus provides fast startup times and a low memory footprint. Quarkus Extensions: Enhancements and integrations with various technologies like Hibernate, RESTEasy, and Kafka. Panache: Simplifies the development of data access layers, making it easier to work with databases. Qute: A templating engine for Quarkus, enabling dynamic content rendering. OpenShift A Kubernetes-based platform that helps manage and deploy containerized applications, simplifying the orchestration of microservices. OpenShift Service Mesh: Integrates Istio, Jaeger, and Kiali to manage microservices traffic flow, observability, and tracing. OpenShift Pipelines: Based on Tekton, it provides a Kubernetes-native CI/CD framework to automate deployments. OpenShift Serverless: Based on Knative, it offers a serverless experience to build and deploy applications on demand. Recommended Books "Building Microservices: Designing Fine-Grained Systems" by Sam Newman "Spring Microservices in Action" by John Carnell "Quarkus Cookbook: Kubernetes-Native Java" by Alex Soto Bueno and Jason Porter 2. Cloud-Native Applications Developing robust applications that harness the full potential of cloud computing is imperative for businesses and organizations. This entails strategically leveraging cloud platforms and services to achieve seamless scalability, heightened reliability, and optimal operational efficiency. By effectively leveraging cloud computing, businesses can streamline their operations, enhance their agility, and facilitate cost-effective resource utilization. Related Technologies AWS: Amazon Web Services offers a comprehensive suite of cloud services. Google Cloud Platform: Provides a range of computing, storage, and application services. Microsoft Azure: Another leading cloud platform with extensive tools for building and managing applications. Recommended Books "Cloud Native Java: Designing Resilient Systems with Spring Boot, Spring Cloud, and Cloud Foundry" by Josh Long and Kenny Bastani 3. Containerization and Orchestration Mastering containerization and orchestration technologies ensures applications run smoothly across different environments, enhancing scalability and reliability. Related Technologies Docker: Enables you to package and run applications in containers. Kubernetes: An open-source system for automating deployment, scaling, and management of containerized applications. OpenShift: Extends Kubernetes with DevOps tools to facilitate container orchestration and management. Recommended Books "Docker: Up & Running" by Karl Matthias and Sean P. Kane "Kubernetes Up & Running" by Kelsey Hightower, Brendan Burns, and Joe Beda 4. Reactive Programming Reactive programming allows for handling asynchronous data streams effectively, which is crucial for modern web applications. Related Technologies Project Reactor: A foundational library for building reactive applications on the JVM. Akka: Toolkit and runtime for building highly concurrent, distributed, and resilient message-driven applications. RxJava: A library for composing asynchronous and event-based programs using observable sequences. Recommended Books "Reactive Programming with RxJava" by Tomasz Nurkiewicz and Ben Christensen "Reactive Spring" by Josh Long 5. Serverless Computing Serverless architecture enables you to build applications without managing infrastructure, improving agility and reducing operational overhead. Related Technologies AWS Lambda: A service that lets you run code without provisioning or managing servers. Azure Functions: A solution for easily running small pieces of code or "functions" in the cloud. Google Cloud Functions: Lightweight, event-based asynchronous compute solutions. Recommended Books "Serverless Architectures on AWS" by Peter Sbarski "Building Serverless Applications with Python" by Mohamed Labouardy 6. Event-Driven Architecture Designing systems that react to events in real time enhances scalability and responsiveness, making them ideal for modern applications. Related Technologies Apache Kafka: A distributed event streaming platform capable of handling trillions of events a day. RabbitMQ: A reliable messaging system that supports multiple messaging protocols. AWS SNS/SQS: Simple Notification Service (SNS) and Simple Queue Service (SQS) for scalable message queueing. Recommended Books "Designing Event-Driven Systems" by Ben Stopford "Kafka: The Definitive Guide" by Neha Narkhede, Gwen Shapira, and Todd Palino 7. Security Best Practices Implementing robust security measures to protect applications from threats and vulnerabilities is paramount for any architect. Related Technologies Spring Security: A powerful and customizable authentication and access control framework. OWASP Tools: Various tools and resources from the Open Web Application Security Project. JWT (JSON Web Tokens): A compact, URL-safe means of representing claims to be transferred between two parties. Recommended Books "Spring Security in Action" by Laurentiu Spilca "Java Security: Writing Secure Code" by Scott Oaks 8. DevOps and CI/CD Integrating development and operations through DevOps practices and implementing CI/CD pipelines is crucial for efficient and reliable software delivery. Related Technologies Jenkins: An open-source automation server that supports building, deploying, and automating projects. GitLab CI/CD: Provides robust CI/CD pipeline support integrated with GitLab. Travis CI: A continuous integration service used to build and test projects hosted on GitHub. Recommended Books "The DevOps Handbook" by Gene Kim, Patrick Debois, John Willis, and Jez Humble "Continuous Delivery: Reliable Software Releases through Build, Test, and Deployment Automation" by Jez Humble and David Farley 9. APIs and Integration Designing robust APIs for service integration ensures seamless communication between different systems, essential for microservices and hybrid cloud environments. Related Technologies REST: An architectural style for designing networked applications. GraphQL: A query language for your API, and a runtime for executing those queries by using a type system you define for your data. OpenAPI/Swagger: Tools for designing, building, documenting, and consuming RESTful web services. Recommended Books "Designing Web APIs" by Brenda Jin, Saurabh Sahni, and Amir Shevat "GraphQL: A Practical Guide with Examples" by Marc-Andre Giroux 10. Data Management and NoSQL Databases Handling large volumes of data effectively and understanding NoSQL databases is critical for performance and scalability. Related Technologies MongoDB: A document database with the scalability and flexibility that you want with the querying and indexing that you need. Cassandra: A distributed NoSQL database management system designed to handle large amounts of data across many commodity servers. Redis: An in-memory data structure store, used as a distributed, in-memory key–value database, cache, and message broker. Recommended Books "NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence" by Pramod J. Sadalage and Martin Fowler "MongoDB: The Definitive Guide" by Kristina Chodorow 11. Distributed Systems Designing and managing distributed systems ensures high availability and fault tolerance, which are crucial for large-scale applications. Related Technologies Apache Zookeeper: A centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services. Consul: Provides service discovery, configuration, and segmentation functionality. Netflix Eureka: A REST-based service that is primarily used in the AWS cloud for locating services for the purpose of load balancing and failover of middle-tier servers. Recommended Books "Designing Data-Intensive Applications" by Martin Kleppmann "Distributed Systems: Principles and Paradigms" by Andrew S. Tanenbaum and Maarten Van Steen 12. Concurrency and Parallelism Efficiently managing concurrency and parallelism improves application performance and responsiveness, making it a critical skill for architects. Related Technologies Java Concurrency Framework: Provides high-level concurrency constructs. Fork/Join Framework: Simplifies the process of developing parallel applications. Reactive Streams: An initiative to provide a standard for asynchronous stream processing with non-blocking backpressure. Recommended Books "Java Concurrency in Practice" by Brian Goetz "Concurrency in Practice with Java" by Heinz Kabutz 13. Performance Tuning and Optimization Regularly tuning and optimizing Java applications ensures they run efficiently under different conditions and scales. Related Technologies Java Mission Control: A suite of tools for monitoring, managing, and troubleshooting Java applications. VisualVM: A visual tool integrating several command-line JDK tools and lightweight profiling capabilities. JProfiler: A powerful profiler for Java that helps troubleshoot performance bottlenecks. Recommended Books "Java Performance: The Definitive Guide" by Scott Oaks "Optimizing Java" by Benjamin J. Evans, Jim Gough, and Chris Newland 14. Understanding Java Ecosystem and Updates Keeping up with the latest Java updates and the ecosystem ensures using the most efficient and secure versions. Related Technologies JDK 17+: The latest long-term support (LTS) version of Java. OpenJDK: The free and open-source implementation of the Java Platform, Standard Edition. Recommended Books "Modern Java in Action" by Raoul-Gabriel Urma, Mario Fusco, and Alan Mycroft "Effective Java" by Joshua Bloch 15. Architectural Patterns and Best Practices Applying proven architectural patterns and best practices leads to more robust and maintainable applications. Related Technologies MVC (Model-View-Controller): An architectural pattern commonly used for developing user interfaces. CQRS (Command Query Responsibility Segregation): A pattern that separates read and update operations for a data store. Event Sourcing: Stores the state of a business entity as a sequence of state-changing events. Recommended Books "Patterns of Enterprise Application Architecture" by Martin Fowler "Domain-Driven Design: Tackling Complexity in the Heart of Software" by Eric Evans 16. Testing and Test-Driven Development (TDD) Implementing thorough testing practices and embracing TDD enhances code quality and reliability. Related Technologies JUnit: A simple framework to write repeatable tests. It is an instance of the xUnit architecture for unit testing frameworks. Mockito: A mocking framework for unit tests in Java. Selenium: A portable framework for testing web applications. Recommended Books "Test-Driven Development: By Example" by Kent Beck "JUnit in Action" by Petar Tahchiev, Felipe Leme, Vincent Massol, and Gary Gregory 17. Graph Databases Understanding and using graph databases enables efficient handling of highly connected data, which is increasingly important in modern applications. Related Technologies Neo4j: A highly scalable native graph database, purpose-built to leverage data relationships. Amazon Neptune: A fully managed graph database service. ArangoDB: A native multi-model database system. Recommended Books "Graph Databases" by Ian Robinson, Jim Webber, and Emil Eifrem "Learning Neo4j" by Rik Van Bruggen 18. Big Data and Analytics Leveraging big data technologies and analytics tools is essential for extracting valuable insights from large datasets. Related Technologies Apache Hadoop: An open-source framework that allows for the distributed processing of large data sets across clusters of computers. Apache Spark: A unified analytics engine for large-scale data processing. Elasticsearch: A distributed, RESTful search and analytics engine. Recommended Books "Big Data: Principles and Best Practices of Scalable Real-Time Data Systems" by Nathan Marz and James Warren "Spark: The Definitive Guide" by Bill Chambers and Matei Zaharia 19. Artificial Intelligence and Machine Learning Integrating AI and ML capabilities into applications can offer competitive advantages and new functionalities. Related Technologies TensorFlow: An open-source library for machine learning. Deeplearning4j: A deep learning library for the JVM. Weka: A collection of machine learning algorithms for data mining tasks. Recommended Books "Artificial Intelligence: A Guide for Thinking Humans" by Melanie Mitchell "Deep Learning with Java" by Yusuke Sugomori 20. Blockchain Technology Understanding blockchain fundamentals and its potential applications can open new possibilities for secure, decentralized applications. Related Technologies Hyperledger Fabric: A permissioned blockchain infrastructure. Ethereum: A decentralized platform that runs smart contracts. Corda: An open-source blockchain platform designed for business. Recommended Books "Blockchain Basics: A Non-Technical Introduction in 25 Steps" by Daniel Drescher "Mastering Blockchain" by Imran Bashir Conclusion The role of a Java software architect is becoming increasingly vital as the software development landscape continues to undergo rapid evolution. In this article, we emphasize the critical knowledge and skills that are indispensable for successfully navigating the intricacies of modern application development. Java architects must not only be proficient in mastering microservices architecture, cloud-native applications, containerization, reactive programming, serverless computing, event-driven architecture, and robust security practices, but they must also understand the interplay between these concepts to ensure that their applications are highly resilient, scalable, and secure in today's dynamic environment. Remaining abreast of the latest technologies and best practices, and engaging in continuous learning through recommended books, is essential for architects to consistently design and construct high-performance applications that can meet the demands of today's digital world. By embracing these fundamental principles, architects not only elevate the quality of their software but also position themselves to steer innovation and lead their organizations into the future.

By Reza Ganji

CORE

Unlocking Potential With Mobile App Performance Testing

Approximately one-fourth of all downloaded applications (25.3%) are used only once. The primary reason for this is their failure to meet user expectations. Issues such as technical glitches, excessive file size, and confusing user interfaces often lead to app removal. It is discouraging to realize that two-thirds of users may never open your app again after just one use. Those who do return are likely to be highly critical. Your aim should not just be to avoid falling into the category of quickly uninstalled apps. It would be best if you also strived to exceed user expectations. The Importance of Performance Testing Testing is a vital phase in the development process of any mobile application before its market release. There are many types of application testing, including performance testing, integration testing, security testing, compatibility testing, and usability testing. Today, I want to focus on performance testing. Performance testing is often overlooked, with a focus on features over system speed and efficiency, especially in API-driven architectures. Agile teams usually delay it, waiting for feature stability, and separating it from main development workflows. However, integrating performance testing early, alongside new code development, provides instant feedback, allowing for immediate fixes and aligning with evolving software practices. Performance and load testing are vital steps, ensuring a stable and robust application that meets user expectations. Performance testing checks how the system behaves under various loads, focusing on indicators such as speed, reliability, and system availability. It identifies potential bottlenecks and weaknesses, which is essential for refining the app. This involves analyzing: Resource usage levels under varying loads. Errors that occur during the application's operation. The maximum number of users the application can support before it becomes unstable. The performance of the subsystem responsible for managing load distribution. Potential weaknesses in the software's architecture. Investing in thorough testing might seem costly, but it prevents the need for time-consuming and expensive fixes or modifications late in the development process. By ensuring your product is tested properly from the start in the secure SDLC, you save time and money in the long term and also accelerate its entry into the market. Adopting automated performance testing can also further reduce the cost of developing mobile applications. Core App Performance Testing Areas For any mobile application, performance testing should be conducted across three critical categories: device, server/API, and network. Device testing is all about making sure the app works smoothly on different devices, paying close attention to startup time, how much memory it uses, and how much battery it drains. Server/API testing emphasizes efficient data management and smooth interactions with the server, including API responsiveness and data exchange. Network performance tests assess the app's behavior across different network types, measuring speed, any packet losses, or connectivity issues. Types of Performance Testing Performance testing encompasses several types, each targeting different aspects of application performance: Load Testing This evaluates an application's performance under expected user loads to identify and address performance bottlenecks. Endurance Testing By applying a consistent load over an extended period, this test checks for issues that could slow down the application over time, ensuring the application's long-term performance stability. Stress Testing This tests an application under extreme conditions to determine its breaking point and how it handles massive traffic and data processing, aiming to identify at what load the application fails. Scalability Testing This determines the application's ability to scale up in response to increased user demand, ensuring it can grow to accommodate more users smoothly. Volume Testing This assesses how the application copes with a large volume of data in the database, ensuring performance is not compromised by data size. Spike Testing This looks at the application's response to sudden spikes in traffic, which is crucial for understanding how unexpected surges in usage are handled. While it may be tempting to use as many types of performance testing as possible, the goal should be to select and prioritize performance tests based on the application's specific needs, usage scenarios, and the resources available for testing. Important Considerations When Doing Performance Tests Testing mobile apps presents more challenges and can be more labor-intensive than testing PC software due to several important factors. The vast number and variety of mobile devices, the increasing mobility of users, and the unique features specific to each device make comprehensive testing a complex task. This diversity necessitates developers to test on as wide a range of hardware as possible, which can be time-consuming and resource-intensive. There are various strategies for conducting mobile app testing, including lab testing, guerrilla testing, and unmoderated remote testing. While performance testing often relies on emulators for initial assessments, this method does not guarantee complete test coverage, for example, in cases like voice and gesture interface testing. Testing on real devices and with real users is more accurate. You can find many services and companies that provide access to a vast array of real devices for testing purposes. This allows developers to select and test on devices that are most relevant to their target audience's preferences and the specific requirements of their clients. Remember to always prioritize the user experience in performance testing. Beyond traditional performance metrics, focus on factors such as app startup time, responsiveness to user inputs, and smoothness of animations and transitions. Do not forget to test your app under various network conditions, including different speeds (Wi-Fi, 3G, 4G, 5G) and qualities (high latency, low bandwidth), to ensure it performs well for all users. Consider geographic variations too. Regularly review and adhere to the performance guidelines and best practices provided by Android, iOS, and other platforms, including those for application deployment, to ensure compliance and optimization. After launching your app, continue monitoring its performance in the live environment. Real user monitoring (RUM) tools can help track actual user experiences and highlight issues that may not have been evident during testing. Please note third-party services (like analytics, ad platforms, or payment gateways) may change rules and affect app performance. Regularly monitor their performance over time. In addition, security is a major concern in mobile app testing. Malicious actors can exploit vulnerabilities in mobile devices, networks, and applications to gain unauthorized access to data or compromise user privacy. For testing the mobile app itself, organizations should adhere to DevSecOps best practices and employ security processes like OWASP Mobile Security Testing. To ensure backend security, they have to rely on solutions like Dynamic Application Security Testing (DAST) or External Attack Surface Management (EASM) to discover, prioritize, and remediate vulnerabilities. Improving Mobile Application Performance Here are the top 15 tips for improving mobile application performance: Keep the application's file size small. Users are reluctant to install apps that take up a lot of space. The smaller your app's footprint, the better it is. Implement lazy loading for content and images, ensuring that items are only loaded when needed. Optimize app images by using scalable vector graphics, implementing caching for faster loads, and simplifying color palettes for efficiency. Minimize and optimize the use of animations. Although animations can enhance the user experience, they can also impact performance. Optimize animations by choosing lightweight formats and timing them carefully to avoid unnecessary consumption of resources. Implement efficient data fetching strategies. Use techniques like pagination, infinite scrolling, or data prefetching to manage data loading efficiently. Improve your application's memory efficiency by using memory-conscious coding practices and minimizing reliance on external libraries. Minimize duplicate network requests, as they can degrade the app's performance. Compress data for network transmission to reduce the amount of data sent over the network. Use efficient queries and indexes in your database. Additionally, consider caching results of frequently accessed data to reduce database load. Perform intensive tasks in the background using multi-threading or asynchronous programming. This prevents the UI thread from being blocked, ensuring the app remains responsive to user interactions. Use the latest programming frameworks. They are designed with performance and efficiency in mind. Migrate to these technologies where possible to take advantage of their optimizations. Optimize your app's energy usage by minimizing wake locks and using battery-efficient location services. Implement efficient error handling. It ensures that your app can recover from unexpected conditions without crashing. Regularly profile your app's performance to identify and optimize slow or inefficient code paths. Android Studio and Apple Xcode can help identify performance bottlenecks. Implement feature flags to toggle functionality. This allows for easier rollback of features that may introduce performance issues and enables A/B testing of performance optimizations. Endnote Testing, particularly performance testing, is crucial for app development, ensuring apps are robust, fast, and user-friendly. Covering various aspects like device compatibility, server/API performance, and network behavior, performance testing identifies potential bottlenecks and guides improvements. Automated performance testing strategies can save time and costs, enhancing market readiness and user retention.

By Alex Vakulov

Mastering System Design: A Comprehensive Guide to System Scaling for Millions, Part 2

In the first part of our system design series, we introduced MarsExpress, a fictional startup tackling the challenge of scaling from a local entity to a global presence. We explored the initial steps of transitioning from a monolithic architecture to a scalable solution, setting the stage for future growth. As we continue this journey, Part 2 focuses on the critical role of the caching layer. We’ll delve into the technological strategies and architectural decisions essential for implementing effective caching, which are crucial for handling millions of users efficiently. architectural decisions pivotal for scaling to meet the demands of millions. Cache Layer For read-heavy applications, relying solely on a primary-replica database architecture often falls short of meeting the performance and scalability demands. While this architecture can improve read throughput by distributing read queries among replicas, it still encounters bottlenecks, especially under massive read load scenarios. This is where the implementation of a distributed cache layer becomes not just beneficial, but essential. A distributed cache, positioned in front of the database layer, can serve frequently accessed data with significantly lower latency than a database, dramatically reducing the load on the primary database and its replicas. By caching the results of read operations, applications can achieve instant data access for the majority of requests, leading to a more responsive user experience and higher throughput. Moreover, a distributed cache scales horizontally, offering a more flexible and cost-effective solution for managing read-heavy workloads compared to scaling databases vertically or adding more replicas. This approach not only alleviates pressure on the database but also ensures high availability and fault tolerance, as cached data is distributed across multiple nodes, minimizing the impact of individual node failures. In essence, for read-heavy applications aiming for scalability and high performance, incorporating a distributed cache layer is a critical strategy that complements and extends the capabilities of primary-replica database architectures. A key characteristic of distributed caches is their ability to handle massive amounts of data by partitioning it across multiple nodes. This approach, often implemented using consistent hashing, balances the load evenly and allows for easy scaling by adding or removing nodes. Additionally, replication ensures data redundancy, enhancing fault tolerance. For instance, Redis Cluster and Hazelcast are popular implementations that provide automatic data partitioning and failover. One of the primary benefits of distributed caches is their ability to significantly improve read performance. By caching frequently accessed data across multiple nodes, applications can serve data with minimal latency, bypassing the database for most read requests. This reduction in database load not only improves response times but also enhances overall system throughput. Furthermore, eviction policies like Least Recently Used (LRU) or Least Frequently Used (LFU) help manage memory efficiently by discarding less important data, ensuring the cache remains performant. However, implementing a distributed cache requires careful consideration of several factors. Network latency can be a critical issue, especially in geo-distributed setups, and must be minimized through strategic placement of cache nodes. Consistency models, ranging from eventual consistency to strong consistency, need to be chosen based on application requirements. Security is also paramount, necessitating encryption of data in transit and at rest, along with robust authentication and authorization mechanisms. Monitoring and metrics play a crucial role in maintaining a distributed cache. Tracking metrics such as hit/miss ratios, latency, and throughput helps identify performance bottlenecks and optimize the cache configuration. Regular monitoring ensures the cache operates efficiently, adapting to changing workloads and maintaining high availability. Distributed caches excel in environments with high read traffic, such as social media platforms, e-commerce sites, and real-time analytics systems. They are also effective in session storage, providing quick access to user session data across large-scale web applications. By leveraging distributed caches, engineers can significantly enhance the performance, scalability, and reliability of their systems, ensuring they can meet the demands of millions of users. Sharding and Horizontal Scaling Sharding and horizontal scaling are fundamental strategies in distributed systems to improve performance, scalability, and fault tolerance. Each approach addresses different aspects of data distribution and system growth: Sharding Shading involves dividing data into smaller subsets (shards) and distributing them across multiple nodes or databases. Each shard operates independently, handling a portion of the overall workload. Sharding enhances scalability by allowing distributed systems to manage larger datasets and higher transaction volumes effectively. Horizontal Scaling Horizontal scaling refers to adding more identical resources (e.g., servers, cache nodes) to a system to distribute workload and increase capacity. It aims to improve performance and accommodate growing demands by leveraging additional hardware resources in parallel. In distributed caching systems, sharding is crucial for efficiently managing data across multiple cache nodes. By partitioning data into shards and distributing them among cache servers, sharding enhances data locality and reduces contention for resources. Each cache node manages a subset of data, enabling parallel processing and improving overall throughput. For example, in a sharded Redis cluster, data keys are distributed across multiple Redis instances (shards), ensuring scalable read and write operations across the cache. On the other side, horizontal scaling complements sharding by allowing distributed caching systems to expand capacity seamlessly. Adding more cache nodes enhances system performance and accommodates increased data storage and access requirements. For instance, a horizontally scaled Memcached cluster can handle growing volumes of cached data and client requests by adding additional cache servers and distributing the workload evenly across nodes to maintain low-latency access. As you might read in Part 1, our fictitious MarsExpress (a local delivery startup based in Albuquerque), uses a distributed caching system to optimize delivery tracking and logistics operations. Here’s how sharding and horizontal scaling play crucial roles in their system. MarsExpress employs sharding in its distributed caching solution to manage real-time tracking data for delivery orders. The system partitions tracking data into geographical regions (shards), with each shard corresponding to deliveries in specific areas (e.g., downtown, suburbs). By distributing data across shards, MarsExpress ensures efficient access and updates to delivery statuses, minimizing latency and optimizing resource utilization. By dividing data into smaller subsets and distributing them among separate cache nodes, each responsible for a specific region, MarsExpress can optimize data access and update speeds. Previously, with a single caching server, latency averaged 20 milliseconds per request. After sharding, this latency can be significantly reduced to 10 milliseconds or less, as data relevant to each region is stored closer to where it is needed most. This approach not only enhances delivery tracking efficiency but also supports scalability as MarsExpress expands its service areas. As MarsExpress expands its delivery services to cover more neighborhoods and handle increasing delivery volumes, horizontal scaling becomes essential. They scale their distributed caching infrastructure horizontally by adding more cache nodes. Each new node enhances system capacity and performance, allowing MarsExpress to handle concurrent requests and store larger datasets without compromising delivery tracking accuracy or responsiveness. on the other hand, involves adding more identical cache nodes to the system to distribute workload and increase overall capacity. In this case, horizontal scaling plays a crucial role in accommodating increased transaction volumes and customer demands. Initially, MarsExpress might handle 5,000 delivery tracking updates per minute with a single caching server. By horizontally scaling and adding more nodes, this capacity can be doubled or even tripled, enabling the system to handle peak delivery periods without compromising performance. This scalability ensures that MarsExpress can maintain real-time visibility into delivery operations, providing customers with accurate tracking information and enhancing overall service reliability. In terms of fault tolerance and availability, the adoption of distributed caching strategies provides our system with improved resilience against system failures. Implementing sharding and maintaining redundant copies of data across multiple nodes, can minimize the risk of service disruptions. For instance, with sharding and redundant caching nodes in place, it can achieve uptime rates of 99.9% or higher. This high availability ensures that customers can track their deliveries seamlessly, even during unforeseen technical issues or maintenance activities. Moreover, the cost efficiency of MarsExpress’s operations is positively impacted by these caching strategies. Initially, operational costs associated with managing and scaling the caching infrastructure may be high due to limited capacity. However, through effective sharding and horizontal scaling, MarsExpress can optimize resource utilization and reduce overhead costs per transaction. In fact, optimizing resource usage through sharding can lead to a 30% reduction in operational costs, while horizontal scaling can further enhance cost efficiency by leveraging economies of scale and improving overall performance metrics Popular Distributed Caches Let’s explore Redis, Memcached, and Apache Ignite in practical scenarios to understand their strengths and use cases. Redis Redis is renowned for its versatility and speed in handling various types of data structures. It’s commonly used as a distributed cache due to its in-memory storage and support for data persistence. In practice, Redis excels in scenarios requiring fast read and write operations, such as session caching, real-time analytics, and leaderboard systems. Its ability to store not just simple key-value pairs but also lists, sets, and sorted sets makes it adaptable to a wide range of caching needs. Redis’s replication and clustering features enhance its resilience and scalability. In real-world applications, setting up Redis as a distributed cache involves configuring master-slave replication or using Redis Cluster for automatic sharding and high availability. These features ensure that even under heavy loads, Redis can maintain performance and reliability. Memcached Memcached is another popular choice for distributed caching, valued for its simplicity and speed. Unlike Redis, Memcached focuses solely on key-value caching without persistence. It’s highly optimized for fast data retrieval and is typically used to alleviate database load by caching frequently accessed data. In practical applications, Memcached shines in scenarios where data volatility isn’t a concern and where rapid access to cached items is critical, such as in web applications handling session data, API responses, and content caching. Its distributed nature allows scaling out by adding more nodes to the cluster, increasing caching capacity, and improving overall performance. Apache Ignite Apache Ignite combines in-memory data grid capabilities with distributed caching and processing. It’s often chosen for applications requiring both caching and computing capabilities in a single platform. In practice, Apache Ignite is used for distributed SQL queries, machine learning model training with cached data, and real-time analytics. What sets Apache Ignite apart is its ability to integrate with existing data sources like RDBMS, NoSQL databases, and Hadoop, making it suitable for hybrid data processing and caching scenarios. Its distributed nature ensures high availability and fault tolerance, critical for handling large-scale datasets and processing complex queries across a cluster. When selecting a distributed cache, practical considerations such as ease of integration, operational overhead, and community support often play a crucial role. From an operational standpoint, configuring and monitoring distributed caches requires expertise in managing clusters, handling failover scenarios, and optimizing cache eviction policies to ensure efficient memory usage. In fact, understanding the trade-offs between consistency, availability, and partition tolerance (CAP theorem) is essential. Distributed caches like Redis and Memcached prioritize availability and partition tolerance, making them suitable for use cases where immediate access to cached data is paramount. Apache Ignite, with its focus on consistency and integration with other data processing frameworks, appeals to applications needing unified data management and computation. Ultimately, the choice of a distributed cache depends on specific application requirements, performance goals, and the operational expertise available. Each of these caches brings unique strengths and trade-offs, making them valuable tools in modern distributed computing environments. Caching Policies Effective caching policies are crucial for optimizing distributed cache performance and reliability. Studies indicate that implementing appropriate caching strategies can reduce database load by up to 70% and improve response times by 80%, significantly enhancing user experience and system efficiency. To illustrate these strategies, let’s revisit MarsExpress, our fictitious startup aiming for global scalability. As MarsExpress expanded, it faced increased load and latency issues, particularly with read-heavy operations. The team implemented several caching policies to address these challenges. Cache-Aside (Lazy Loading) We used this policy to minimize initial load times. When a user requested data not in the cache, the system fetched it from the database, cached it, and returned the result. For example, when users frequently accessed the latest mission updates, the first request after a cache miss would be slightly slower, but subsequent requests were instant. This reduced direct database queries and ensured that frequently accessed data was readily available. For example, Facebook employs cache-aside to manage its massive scale. Frequently accessed user data, like profile information, is fetched from the database upon cache misses, and then cached for subsequent requests. This reduces database load and speeds up response times for users. This approach is preferred when application data access patterns are unpredictable. It ensures that only necessary data is cached, optimizing memory usage and reducing unnecessary cache population. Read-Through To streamline data access, we configured our cache to query the database on a cache miss directly. This approach simplified application logic and ensured that data in the cache was always up-to-date, reducing the complexity of manually managing cache refreshes. For instance, when users looked up historical mission data, the cache would fetch the latest data if not already available, ensuring consistency. Netflix uses read-through caching for its recommendation engine. When a cache miss occurs, the system fetches the latest recommendations from the database and updates the cache, ensuring users always see the most current data. Read-through is better for applications where data consistency is critical, and frequent database updates are needed. It simplifies the development process by abstracting the caching layer from the application code. Write-Through Ensuring data consistency was critical for MarsExpress, especially for transactions. By writing data to the cache and database simultaneously, they maintained synchronization, ensuring users always had access to the most current information without added complexity. This was crucial for real-time telemetry data, where accuracy was paramount. Financial institutions often use write-through caching to ensure transactional data consistency. Every write operation updates both the cache and the database, guaranteeing that cached data is always synchronized with the underlying data store. This policy is ideal for applications requiring strong consistency and immediate data propagation, ensuring that the cache and database remain in sync. Write-Behind (Write-Back) To optimize write performance, MarsExpress adopted a write-behind policy for non-critical data, such as user activity logs. This allowed the cache to handle writes quickly and batch database updates asynchronously, reducing write latency and database load. For example, user feedback and interaction logs were cached and later written to the database in batches, ensuring the system remained responsive. E-commerce platforms like Amazon use write-behind caching for logging user activities and interactions. This ensures fast write performance and reduces the immediate load on the database. This policy is preferred for applications where high write throughput is needed, and eventual consistency is acceptable. It improves performance by deferring database updates. Refresh-Ahead Anticipating user behavior, MarsExpress used refresh-ahead to update cache entries before they expired. By predicting which data would be requested next, they ensured that users experienced minimal latency, particularly during peak times. This was particularly useful for scheduled data releases, where the cache preloaded updates right before they went live. News websites use refresh-ahead to keep their front-page articles updated. By preloading anticipated popular articles, they ensure minimal latency when users access the latest news. This strategy is useful for applications with predictable access patterns. It ensures that frequently accessed data is always fresh, reducing latency during peak access times. Eviction Policies: Ensuring Optimal Cache Performance Managing cache memory efficiently is critical for maintaining high performance and responsiveness. Eviction policies determine which data to remove when the cache reaches its capacity, ensuring that the most relevant data remains accessible. Least Recently Used (LRU) MarsExpress implemented the LRU eviction policy to manage its high volume of data. This policy evicts the least recently accessed items, ensuring that frequently accessed data remains in the cache. For instance, older telemetry data was evicted in favor of newer, more relevant data. Twitter uses LRU eviction to manage tweet caches. Older, less accessed tweets are evicted to make room for new ones, ensuring the cache contains the most relevant data. LRU is effective in scenarios where recently accessed data is likely to be accessed again. It optimizes cache usage by retaining the most relevant data, making it ideal for applications with access patterns that favor recency. Least Frequently Used (LFU) In contrast to LRU, the LFU policy evicts items that are accessed least often. MarsExpress considered LFU for its user profile cache, ensuring that popular profiles remained cached while infrequently accessed profiles were evicted. Content delivery networks (CDNs) often use LFU to manage cached content, ensuring that popular content remains available to users while less popular content is evicted. LFU is beneficial for applications where certain data is accessed repeatedly over a long period. It ensures that the most popular data remains in the cache, optimizing for long-term access patterns. Time-To-Live (TTL) MarsExpress utilized TTL settings to automatically expire stale data. Each cache entry had a defined lifespan, after which it was removed from the cache, ensuring that outdated information did not linger. Online retail platforms like Shopify use TTL to keep product availability and pricing information current. Changes in inventory or price immediately invalidate outdated cache entries. TTL is crucial for applications where data freshness is vital. It ensures that the cache reflects the most current data, reducing the risk of serving stale information. TTL is particularly useful in dynamic environments where data changes frequently. Custom Eviction Policies MarsExpress experimented with custom eviction policies tailored to specific application needs. For example, they combined LRU with TTL for their mission data cache, ensuring both recency and freshness were maintained. Google uses custom eviction policies for its search index, balancing freshness and relevance to provide the most accurate search results. Custom policies offer flexibility to address unique application requirements. They can combine elements of different eviction strategies to optimize cache performance based on specific data access patterns and business needs. By carefully selecting and implementing these eviction policies, MarsExpress ensured that its cache remained performant and responsive, even as data volumes grew. These strategies not only improved system performance but also enhanced the overall user experience, showcasing the importance of well-implemented eviction policies in large-scale system design. Conclusion As MarsExpress continues to evolve and meet the demands of millions, the integration of a distributed caching layer has proven to be pivotal. By strategically employing sharding, horizontal scaling, and carefully chosen caching policies, MarsExpress has optimized performance, enhanced scalability, and ensured data consistency and availability. These strategies have not only improved user experience but have also demonstrated the critical role of distributed caching in modern system design. In Part 3 of our series, we will explore the transition to microservices, delving into how breaking down applications into smaller, independent services can further enhance scalability, resilience, and flexibility. Stay tuned as we continue to guide MarsExpress on its journey to mastering system design.

By Alireza Rahmani Khalili

CORE

Orchestrating IAT, IPA, and RPA With Low-Code Platforms: Benefits and Challenges of Advanced Automation and Testing

Editor's Note: The following is an article written for and published in DZone's 2024 Trend Report, Low-Code Development: Elevating the Engineering Experience With Low and No Code. When software development teams face pressure to deliver high-quality applications rapidly, low-code platforms offer the needed support for rapidly evolving business requirements and complex integrations. Integrating intelligent automated testing (IAT), intelligent process automation (IPA), and robotic process automation (RPA) solutions, which can adapt to changes more readily, ensures that testing and automation keep pace with the evolving applications and processes. In a low-code development environment, as shown in Figure 1, IAT, IPA, and RPA can reduce manual effort and improve test coverage, accuracy, and efficiency in the SDLC and process automation. Figure 1. Low-code development environment Using IAT, IPA, and RPA with low-code platforms can also achieve faster time to market, reduced costs, and increased productivity. The intersection of IAT, IPA, RPA, and low code is a paradigm shift in modern software development and process automation, and the impact extends to industries like professional services, consumer goods, banking, and beyond. This article explores all three integrations. For each integration, we will highlight advantages and disadvantages, explore factors to consider when deciding whether to integrate, present a use case, and highlight key implementation points. The use cases presented are popular examples of how these technologies can be applied in specific scenarios. These use cases do not imply that each integration is limited to the mentioned domains, nor do they suggest that the integrations cannot be used differently within the same domains. The flexibility and versatility of the three integrations explored in this article allow for a wide range of applications across different industries and processes. IAT With Low-Code Development AI-driven test case generation in intelligent automated testing can explore more scenarios, edge cases, and application states, leading to better test coverage and higher application quality. This is particularly beneficial in low-code environments, where complex integrations and rapidly evolving requirements can make comprehensive testing challenging. By automating testing tasks, such as test case generation, execution, and maintenance, IAT can significantly reduce the manual effort required, leading to increased efficiency and cost savings. This is advantageous in low-code development, where citizen developers with limited testing expertise are involved, minimizing the need for dedicated testing resources. Low-code platforms enable rapid application development, but testing can become a bottleneck. Automated testing and IAT can provide rapid feedback on application quality and potential issues, enabling quicker identification and resolution of defects. This may accelerate the overall development and delivery cycle. It may also allow organizations to leverage the speed of low code while maintaining quality standards. We need to keep in mind, though, that not all low-code platforms may integrate with all IAT solutions. IAT solutions may require access to sensitive application data, logs, and other information for training AI/ML models and generating test cases. In cases where training and software engineering skill development is necessary for AI/ML in IAT, we need to also consider costs like maintenance and support as well as customization and infrastructure. The decision on whether to integrate IAT with a low-code platform involves a number of factors that are highlighted in the table below: Table 1. Integrating IAT with low-code development When to Integrate When Not to Integrate Rapid development is critical, but only citizen developers with limited testing experience are available Simple applications have limited functionality, and the low-code platform already provides sufficient testing capabilities Applications built on low-code platforms have good options for IAT integration Complexity and learning curve are high, and a deep understanding of AI/ML is required Complex applications need comprehensive test coverage, requiring extensive testing There are compatibility, interoperability, and data silo issues Frequent release cycles have well-established CI/CD pipelines Data security and regulatory compliance are challenges Enhanced decision-making for testing process is needed There are budget constraints Use Case: Professional Services A low-code platform will be used to develop custom audit applications. Since IAT tools can be integrated to automate the testing of these applications, a professional services company will leverage IAT to enhance the accuracy, speed, efficiency, and effectiveness of its audit and assurance services. Implementation main points are summarized in Figure 2 below: Figure 2. IAT with low-code development for a custom audit app In this professional services use case for integrating IAT with low code, custom audit applications could also be developed for industries such as healthcare or finance, where automated testing can improve compliance and risk management. IPA With Low-Code Development Intelligent process automation may significantly enhance efficiency by automating various aspects of the software development and testing lifecycle. Low-code environments can benefit from IPA's advanced AI technologies, such as machine learning, natural language processing (NLP), and cognitive computing. These enhancements allow low-code platforms to automate more complex and data-intensive tasks that go beyond simple rule-based processes. IPA is not limited to simple rule-based tasks; it incorporates cognitive automation capabilities. This makes IPA able to handle more complex scenarios involving unstructured data and decision-making. IPA can learn from data patterns and make decisions based on historical data and trends. This is particularly useful for testing scenarios that involve complex logic and variable outcomes. For example, IPA can handle unstructured data like text documents, images and emails by using NLP and optical character recognition. IPA may be used to automate complex workflows and decision-making processes, reducing the need for manual intervention. End-to-end workflows and business processes can be automated, including approvals, notifications, and escalations. Automated decision-making can handle tasks such as credit scoring, risk assessment, and eligibility verification without human involvement based on predefined criteria and real-time data analysis. With IPA, low-code testing can go beyond testing applications since we can test entire processes across different verticals of an organization. As IPA can support a wide range of integration scenarios across verticals, security and regulatory compliance may be an issue. If the low-code platform does not fully support the wide range of integrations available by IPA, then we need to consider alternatives. Infrastructure setup, data migration, data integration, licensing, and customization are examples of the costs involved. The following table summarizes the factors to consider before integrating IPA: Table 2. Integrating IPA with low-code development When to Integrate When Not to Integrate Stringent compliance and regulatory requirements exist that change in an adaptable, detailed, and easy-to-automate fashion Regulatory and security compliance frameworks are too rigid, having security/compliance gaps with potential legal issues, leading to challenges and uncertainties Repetitive processes exist across verticals where efficiency and accuracy can be enhanced No clear optimization goals; manual processes are sufficient Rapid development and deployment of scalable automation solutions is necessary The low-code platform has limited customization for IPA End-to-end business processes can be streamlined There is limited IT expertise Decision-making for complex process optimization is necessary There are high initial implementation costs Use Case: Consumer Goods A leading consumer goods company wants to utilize IPA to enhance its supply chain management and business operations. They will use a low-code platform to develop supply chain applications, and the platform will have the option to integrate IPA tools to automate and optimize supply chain processes. Such an integration will allow the company to improve supply chain efficiency, reduce operational costs, and enhance product delivery times. Implementation main points are summarized in Figure 3 below: Figure 3. IPA with low-code development for a consumer goods company This example of integrating IPA with low code in the consumer goods sector could be adapted for industries like retail or manufacturing, where inventory management, demand forecasting, and production scheduling can be optimized. RPA With Low-Code Development Robotic process automation and low-code development have a complementary relationship as they can be combined to enhance the overall automation and application development capabilities within an organization. For example, RPA can be used to automate repetitive tasks and integrate with various systems. Low-code platforms can be leveraged to build custom applications and workflows quickly, which may result in faster time to market. The rapid development capabilities of low-code platforms, combined with the automation power of RPA, may enable organizations to quickly build and deploy applications. By automating repetitive tasks with RPA and rapidly building custom applications with low-code platforms, organizations can significantly improve their overall operational efficiency and productivity. RPA in a low-code environment can lead to cost savings by minimizing manual effort, reducing development time, and enabling citizen developers to contribute to application development. Both RPA and low-code platforms offer scalability and flexibility, allowing organizations to adapt to changing business requirements and scale their applications and automated processes as needed. RPA bots can dynamically scale to handle varying volumes of customer queries. During peak times, additional bots can be deployed to manage the increased workload, ensuring consistent service levels. RPA tools often come with cross-platform compatibility, allowing them to interact with various applications and systems and enhancing the flexibility of low-code platforms. Data sensitivity may be an issue here as RPA bots may directly access proprietary or sensitive data. For processes that are unstable, difficult to automate, or unpredictable, RPA may not provide the expected gains. RPA relies on structured data and predefined rules to execute tasks. Frequently changing, unstable, and unstructured processes that lack clear and consistent repetitive patterns may pose significant challenges for RPA bots. Processes that are complex to automate often involve multiple decision points, exceptions, and dependencies. While RPA can handle some level of complexity, it is not designed for tasks requiring deep context understanding or sophisticated decision-making capabilities. The following table summarizes the factors to consider before integrating RPA: Table 3. Integrating RPA with low-code development When to IntegratE When NOT to Integrate Existing system integrations can be further enhanced via automation Tasks to be automated involve unstructured data and complex decision-making Repetitive tasks and processes exist where manual processing is inefficient Rapidly changing and complex processes must be automated Cost savings are expected by automating heavy loads of structured and repetitive tasks Implementation and maintenance costs of the integration are high Scalability and flexibility of RPA can be leveraged by the low-code platform There is a lack of technical expertise Time to market is important RPA bots operate on sensitive data without safeguarding Use Case: Banking A banking organization aims to streamline its data entry processes by integrating RPA with low-code development platforms to automate repetitive and time-consuming tasks, such as form filling, data extraction, and data transfer between legacy and new systems. The integration is expected to enhance operational efficiency, reduce manual errors, ensure data accuracy, and increase customer satisfaction. Additionally, it will allow the bank to handle increased volumes of customer data with greater speed and reliability. The low-code platform will provide the flexibility to rapidly develop and deploy custom applications tailored to the bank's specific needs. RPA will handle the automation of back-end processes, ensuring seamless and secure data management. Implementation main points are summarized in Figure 4 below: Figure 4. RPA with low-code development for a banking organization In this banking example for integrating RPA with low code, while RPA is used to automate back-end processes such as data entry and transfer, it can also automate front-end processes like customer service interactions and loan processing. Additionally, low code with RPA can be applied in domains such as insurance or telecommunications to automate claims processing and customer onboarding, respectively. Conclusion The value of technological integration lies in its ability to empower society and organizations to evolve, stay competitive, and thrive in a changing landscape — a landscape that calls for innovation and productivity to address market needs and societal changes. By embracing IAT, IPA, RPA, and low-code development, businesses can unlock new levels of agility, efficiency, and innovation. This will enable them to deliver exceptional customer experiences while driving sustainable growth and success. As the digital transformation journey continues to unfold, the integration of IAT, IPA, and RPA with low-code development will play a pivotal role and shape the future of software development, process automation, and business operations across industries. This is an excerpt from DZone's 2024 Trend Report, Low-Code Development: Elevating the Engineering Experience With Low and No Code.Read the Free Report

By Stelios Manioudakis, PhD

CORE

The Role of AI in Low- and No-Code Development

Editor's Note: The following is an article written for and published in DZone's 2024 Trend Report, Low-Code Development: Elevating the Engineering Experience With Low and No Code. The advent of large language models (LLMs) has led to a rush to shoehorn artificial intelligence (AI) into every product that makes sense, as well as into quite a few that don't. But there is one area where AI has already proven to be a powerful and useful addition: low- and no-code software development. Let's look at how and why AI makes building applications faster and easier, especially with low- and no-code tools. AI's Role in Development First, let's discuss two of the most common roles AI has in simplifying and speeding up the development process: Generating code Acting as an intelligent assistant AI code generators and assistants use LLMs trained on massive codebases that teach them the syntax, patterns, and semantics of programming languages. These models predict the code needed to fulfill a prompt — the same way chatbots use their training to predict the next word in a sentence. Automated Code Generation AI code generators create code based on input. These prompts take the form of natural language input or code in an integrated development environment (IDE) or on the command line. Code generators speed up development by freeing programmers from writing repetitive code. They can reduce common errors and typographical mistakes, too. But similar to the LLMs used to generate text, code generators require scrutiny and can make their own errors. Developers need to be careful when accepting code generated by AI, and they must test not just whether it builds but also that it does what the user asks. gpt-engineer is an open-source AI code generator that accepts natural language prompts to build entire codebases. It works with ChatGPT or custom LLMs like Llama. Intelligent Assistants for Development Intelligent assistants provide developers with real-time help as they work. They work as a form of AI code generator, but instead of using natural language prompts, they can autocomplete, provide in-line documentation, and accept specialized commands. These assistants can work inside programming tools like Eclipse and Microsoft's VS Code, the command line, or all three. These tools offer many of the same benefits as code generators, including shorter development times, fewer errors, and reduced typos. They also serve as learning tools since they provide developers programming information as they work. But like any AI tool, AI assistants are not foolproof — they require close and careful monitoring. GitHub's Copilot is a popular AI programming assistant. It uses models built on public GitHub repositories, so it supports a very wide variety of languages and plugs into all the most popular programming tools. Microsoft's Power Platform and Amazon Q Developer are two popular commercial options, while Refact.ai is an open-source alternative. AI and Low and No Code: Perfect Together Low and no code developed in response to a need for tools that allow newcomers and non-technologists to quickly customize software for their needs. AI takes this one step further by making it even easier to translate ideas into software. Democratizing Development AI code generators and assistants democratize software development by making coding more accessible, enhancing productivity, and facilitating continuous learning. These tools lower the entry barriers for newcomers to programming. A novice programmer can use them to quickly build working applications by learning on the job. For example, Microsoft Power Apps include Copilot, which generates application code for you and then works with you to refine it. How AI Enhances Low- and No-Code Platforms There are several important ways that AI enhances low- and no-code platforms. We've already covered AI's ability to generate code snippets from natural language prompts or the context in a code editor. You can use LLMs like ChatGPT and Gemini to generate code for many low-code platforms, while many no-code platforms like AppSmith and Google AppSheet use AI to generate integrations based on text that describes what you want the integration to do. You can also use AI to automate preparing, cleaning, and analyzing data, too. This makes it easier to integrate and work with large datasets that need tuning before they're suitable for use with your models. Tools like Amazon SageMaker use AI to ingest, sort, organize, and streamline data. Some platforms use AI to help create user interfaces and populate forms. For example, Microsoft's Power Platform uses AI to enable users to build user interfaces and automate processes through conversational interactions with its copilot. All these features help make low- and no-code development faster, including in terms of scalability, since more team members can take part in the development process. How Low and No Code Enable AI Development While AI is invaluable for generating code, it's also useful in your low- and no-code applications. Many low- and no-code platforms allow you to build and deploy AI-enabled applications. They abstract away the complexity of adding capabilities like natural language processing, computer vision, and AI APIs from your app. Users expect applications to offer features like voice prompts, chatbots, and image recognition. Developing these capabilities "from scratch" takes time, even for experienced developers, so many platforms offer modules that make it easy to add them with little or no code. For example, Microsoft has low-code tools for building Power Virtual Agents (now part of its Copilot Studio) on Azure. These agents can plug into a wide variety of skills backed by Azure services and drive them using a chat interface. Low- and no-code platforms like Amazon SageMaker and Google's Teachable Machine manage tasks like preparing data, training custom machine learning (ML) models, and deploying AI applications. And Zapier harnesses voice to text from Amazon's Alexa and directs the output to many different applications. Figure 1. Building low-code AI-enabled apps with building blocks Examples of AI-Powered Low- and No-Code Tools This table contains a list of widely used low- and no-code platforms that support AI code generation, AI-enabled application extensions, or both: Table 1. AI-powered low- and no-code tools Application Type Primary Users Key Features AI/ML Capabilities Amazon CodeWhisperer AI-powered code generator Developers Real-time code suggestions, security scans, broad language support ML-powered code suggestions Amazon SageMaker Fully managed ML service Data scientists, ML engineers Ability to build, train, and deploy ML models; fully integrated IDE; support for MLOps Pre-trained models, custom model training and deployment GitHub Copilot AI pair programmer Developers Code suggestions, multi-language support, context-aware suggestions Generative AI model for code suggestions Google Cloud AutoML No-code AI Data scientists, developers High-quality custom ML models can be trained with minimal effort; support for various data types, including images, text, and audio Automated ML model training and deployment Microsoft Power Apps Low-code app development Business users, developers Custom business apps can be built; support for many diverse data sources; automated workflows AI builder for app enhancement Microsoft Power Platform Low-code platform Business analysts, developers Business intelligence, app development, app connectivity, robotic process automation AI app builder for enhancing apps and processes Pitfalls of Using AI for Development AI's ability to improve low- and no-code development is undeniable, but so are its risks. Any use of AI requires proper training and comprehensive governance. LLM's tendency to "hallucinate" answers to prompts applies to code generation, too. So while AI tools lower the barrier to entry for novice developers, you still need experienced programmers to review, verify, and test code before you deploy it to production. Developers use AI by submitting prompts and receiving responses. Depending on the project, those prompts may contain sensitive information. If the model belongs to a third-party vendor or isn't correctly secured, your developers expose that information. When it works, AI suggests code that is likely to fulfill the prompt it's evaluating. The code is correct, but it's not necessarily the best solution. So a heavy reliance on AI to generate code can lead to code that is difficult to change and represents a large amount of technical debt. AI is already making important contributions toward democratizing programming and speeding up low- and no-code development. As LLMs gradually improve, AI tools for creating software will only get better. Even as these tools improve, IT leaders still need to proceed cautiously. AI offers great power, but that power comes with great responsibility. Any and all use of AI requires comprehensive governance and complete safeguards that protect organizations from errors, vulnerabilities, and data loss. Conclusion Integrating AI into low- and no-code development platforms has already revolutionized software development. It has democratized access to advanced coding and empowered non-experts so that they can build sophisticated applications. AI-driven tools and intelligent assistants have reduced development times, improved development scalability, and helped minimize common errors. But these powerful capabilities come with risks and responsibilities. Developers and IT leaders need to establish robust governance, testing regimes, and validation systems if they want to safely harness AI's full potential. AI technologies and models continue to improve, and it's probable that they will become the cornerstone of innovative, efficient, and secure software development. See how AI can help your organization widen your development efforts via low- and no-code tools. This is an excerpt from DZone's 2024 Trend Report, Low-Code Development: Elevating the Engineering Experience With Low and No Code.Read the Free Report

By Eric Goebelbecker

CORE

Transforming Software Development With Low-Code and No-Code Integration

Editor's Note: The following is an article written for and published in DZone's 2024 Trend Report, Low-Code Development: Elevating the Engineering Experience With Low and No Code. Although the traditional software development lifecycle (SDLC) is slow and incapable of addressing dynamic business needs, it has long been the most popular way companies build applications — that was until low- and no-code (LCNC) tools entered the fray. These tools simplify the coding process so that advanced developers and non-technical users can contribute, enabling organizations to respond rapidly to market needs by shortening the SDLC. Read on to learn how software development is changing thanks to LCNC tools, how to integrate them into your operations, and the challenges that might arise when integrating them. Understanding Low- and No-Code Development Low- and no-code development environments let people build apps through visual interfaces, drag-and-drop tools, and reusable components without writing code by hand. Low-code development platforms are visual development environments that empower developers of any skill level to drop components onto a palette and connect them to create a mobile or web app. No-code development platforms target users with no or little coding experience. So how can you leverage these platforms to enhance the conventional SDLC? Suppose you are a designer with basic coding skills. Using an LCNC platform, you can quickly create a prototype using reusable components without writing a single line of code. This will expedite the software development process and ensure that the final product meets user needs. Planning and Assessment for Low- and No-Code Integration Though the SDLC varies across companies due to different SDLC models, it often comprises these stages: project planning, requirements gathering and analysis, design, testing, deployment, and maintenance. This process ensures a high level of detail but slows the development cycle and uses substantial resources. Low- and no-code tools address this challenge. For instance, during the design stage, HR teams can quickly design their recruitment portal using reusable components or pre-made templates provided by an LCNC platform to easily track candidates, job postings, and interview scheduling. That said, before integrating LCNC platforms into your existing workflows, consider your team's expertise, the compatibility of your IT infrastructure with your chosen platform, and the platform's security features. Steps for Low- and No-Code Integration To integrate low- and no-code tools into your operations, follow the steps: Figure 1. Streamlined steps for seamless LCNC integration Table 1 expands upon these steps, including an example for each: Table 1. Steps for integrating LCNC tools Step Description Example 1. Define objectives and goals Clearly define what you want to achieve and make it specific. Objectives could include speeding up development, reducing costs, or boosting your team's productivity. Reduce app development time by 40% within six months using pre-built templates. 2. Choose the right LCNC platforms Don't settle. Evaluate various platforms and match them with your needs and objectives. Consider user friendliness, security features, and compatibility with your existing systems. Choose a no-code platform primarily for its ease of use and educational support if most team members aren't tech-savvy. 3. Train and onboard your team members Ensure that all team members, tech-savvy or not, can use your platform. Set up lectures and webinars, or even sign up your team members for professional courses offered by LCNC platforms. 4. Design your integration architecture Ensure your platform is designed to be easily integrated with your current systems. Map how data flows between the current system and the preferred platform. 5. Implement an integration framework Create a framework for integrating LCNC platforms within your SDLC. At its simplest level, this may involve creating guidelines for selecting tools for each stage of the SDLC. Integrate LCNC platforms to collect customer survey responses, product launch feedback, or contact form submissions. Without these tools, developers would need to build the features from scratch, requiring extensive front-end development and storage integration, leading to higher costs and longer development cycles. 6. Conduct testing and quality assurance Conduct rigorous testing of your LCNC tools using a mix of testing approaches, such as unit, integration, and acceptance testing. You can perform acceptance testing to ensure your app meets end users' needs and expectations. 7. Manage deployments and releases Deploy your application to end users in a structured manner using a deployment strategy (e.g., rolling deployment) and include a rollback plan in case of unforeseen issues. You can use cloud solutions to automate deployment. 8. Monitor and maintain Monitor the app's performance after deployment to detect potential related issues. During maintenance, you may encounter bugs. Scheduling periodical bug fixes can maintain your app's stability, functionality, and security. Integrating Low and No Code: Practices to Implement and Avoid While low-code and no-code platforms streamline the SDLC, implementing them requires a structured approach. Next, we will explore best practices and counterproductive practices to consider during integration. Implement: Incremental Adoption Gradually integrating low- and no-code platforms into existing processes and systems can minimize disruptions to ongoing operations. Begin by implementing LCNC solutions for non-critical projects that can act as a sandbox for refining integration strategies. For instance, developers can move LCNC processes incrementally, starting with non-critical, easily packaged processes and gradually scaling it up over time. Non-critical processes, such as email notices, are more conducive to a slow and iterative rollout to a smaller portion of customers. Implement: Collaborative Development Collaborative development is a methodology that emphasizes teamwork. It brings together various stakeholders involved in the SDLC, such as project managers, business analysts, UX/UI designers, software developers, and other technical and non-technical personnel. This approach considers every stakeholder's input, resulting in the delivery of high-quality applications. Encourage collaboration by establishing clear roles and responsibilities for every stakeholder involved in the SDLC. Implement: Hybrid Development Models Combining low- and no-code platforms with traditional coding offers a balanced approach. While LCNC platforms can accelerate development, complex functionalities may require custom code. Embracing a hybrid approach can promote flexibility and maintain the integrity of applications without sacrificing the enhanced functionalities that traditional coding provides. Implement: Continuous Feedback Loops Low-code and no-code tools accelerate feedback loops, allowing teams to build prototypes rapidly, gather user feedback early and often, and refine applications based on received feedback. This approach ensures that final products align with user needs and expectations and adapt quickly to dynamic business requirements. Avoid: Over-Reliance on Low- and No-Code Platforms Low-code and no-code tools aren't supposed to overhaul traditional coding. Complex logic or performance-critical tasks still require conventional software development approaches. As a result, businesses should adopt a hybrid development model. Avoid: Lack of Proper Training and Education If misused, low code and no code can do more harm than good. Poor deployment can result in downtime, which rapidly increases costs in terms of lost customers and damages reputation (e.g., where many customers are being served) — even one second of unavailability has immense costs. The ability to benefit from these groundbreaking platforms relies wholly on providing technical and non-technical users with the proper training to avoid cumulative abnormalities. Avoid: Neglecting Security and Compliance Concerns Low- and no-code platforms eliminate various obstacles associated with conventional SDLC processes. However, they bring about security concerns primarily because your chosen platform hosts your data. Assess the security features of your selected low- or no-code platform to ensure that it meets your organization's data protection regulations and other industry regulations (e.g., GDPR, HIPAA, CCPA) to avoid security breaches and legal issues. Avoid: Ignoring Scalability and Customization Requirements Not all low- and no-code platforms scale well or allow sufficient customization. For instance, some platforms cap the number of team members using them, while others have storage restrictions. This can be a massive obstacle for growing businesses or those with particular needs. Assess whether the platform you're considering can scale and be customized to meet long-term business goals before settling on one. Low and No Code Challenges and Mitigation Strategies Incorporating low- and no-code tools into existing processes presents a few distinctive obstacles. Table 2 describes common challenges associated with integrating LCNC tools into the SDLC and their respective mitigation strategies: Table 2. Integrating low and no code: challenges and mitigation strategies Obstacle Challenge Mitigation Strategy Change Often the most widespread challenge; employees fear that LCNC tools will have a steep learning curve Implement extensive training programs to equip team members with the necessary skill sets Incompatibility and interoperability Can hinder the integration of LCNC platforms (e.g., due to incompatibility with outdated database protocols) Rigorously evaluate platforms to ensure compatibility with existing systems or that they can connect to systems not connected via APIs Technical limitations Can prevent the integration of LCNC platforms (i.e., lack of scalability) Select platforms that are scalable from the start or that provide a hybrid development approach Future of Low Code and No Code in the SDLC As low- and no-code platforms evolve, we can expect a significant transformation in software development practices. While LCNC tools won't make traditional coding obsolete, they'll accelerate development, lower costs, minimize technical debt, and democratize app development — allowing more people to build software without advanced programming skills. Low- and no-code development tools aren't just a passing trend. They are here to stay and will change how we develop and maintain software. By 2025, Gartner estimates that 70% of all new applications that enterprises develop will use LCNC technologies. Existing trends in the emerging LCNC space suggest that these platforms will grow to support increasingly complex features, such as advanced workflows and integrations. Most importantly, AI will be central to this evolution. AI-enhanced LCNC platforms that offer digital chatbots, image recognition, personalization, and other advanced features are already on the market. Conclusion Forrester says that low-code and no-code tools are "redefining how organizations build software"; the low code market alone is expected to reach nearly $30 billion in value by 2028. If you want your organization to keep up, you can't afford to ignore LCNC platforms. By implementing these steps, organizations can effectively integrate LCNC solutions: Organizations should set clear goals about what they wish to accomplish with LCNC solutions. Then, they should select suitable platforms based on their specific needs. Organizations should train their teams on the selected platform. Teams should carefully integrate the new system with existing systems and test it thoroughly before deploying it. Ultimately, a successful integration depends on adopting best practices (e.g., incremental adoption, collaborative development, hybrid development) and avoiding counterproductive practices (e.g., heavy reliance on LCNC tools, failure to consider security and scalability). Are you not using low- and no-code tools? Introduce them into your existing workflow to support your SDLC process. Additional resources: Building Low-Code Applications with Mendix: Discover Best Practices and Expert Techniques to Simplify Enterprise Web Development by Bryan Kenneweg, Imran Kasam, and Micah McMullen Cost of Data Center Outages by Ponemon Institute "Gartner Says Cloud Will Be the Centerpiece of New Digital Experiences" by Gartner "The Low-Code Market Could Approach $50 Billion By 2028" by Forrester This is an excerpt from DZone's 2024 Trend Report, Low-Code Development: Elevating the Engineering Experience With Low and No Code.Read the Free Report

By Shantanu Kumar

CORE

Documenting a Spring REST API Using Smart-doc

If you are developing a RESTful API with Spring Boot, you want to make it as easy as possible for other developers to understand and use your API. Documentation is essential because it provides a reference for future updates and helps other developers integrate with your API. For a long time, the way to document REST APIs was to use Swagger, an open-source software framework that enables developers to design, build, document, and consume RESTful Web services. In 2018, to address the issues of code invasiveness and dependency associated with traditional API documentation tools like Swagger, we developed smart-doc and open-sourced it to the community. In this article, we will explore how to use Smart-doc to generate documentation for a Spring Boot REST API. What Is Smart-doc? Smart-doc is an interface documentation generation tool for Java projects. It primarily analyzes and extracts comments from Java source code to produce API documentation. Smart-doc scans standard Java comments in the code, eliminating the need for specialized annotations like those used in Swagger, thus maintaining the simplicity and non-invasiveness of the code. It supports multiple formats for document output, including Markdown, HTML5, Postman Collection, OpenAPI 3.0, etc. This flexibility allows developers to choose the appropriate documentation format based on their needs. Additionally, Smart-doc can scan code to generate JMeter performance testing scripts. For more features, please refer to the official documentation. Steps To Use Smart-doc for Documenting APIs Step 1: Maven Project Create a Maven project with the latest version of Spring Boot Add the Web dependencies to the project Step 2: Add Smart-doc Into the Project Add smart-doc-maven-plugin to the project's pom.xml XML <plugin> <groupId>com.ly.smart-doc</groupId> <artifactId>smart-doc-maven-plugin</artifactId> <version>[latest version]</version> <configuration> <configFile>./src/main/resources/smart-doc.json</configFile> <projectName>${project.description}</projectName> </configuration> </plugin> Create the smart-doc.json file in the resources directory of the module where the project startup class is located. JSON { "outPath": "/path/to/userdir" } Step 3: Create a Rest Controller Now let's create a controller class that will handle HTTP requests and return responses. Create a controller class that will be sent as a JSON response. Java public class User { /** * user id * */ private long id; /** * first name */ private String firstName; /** * last name */ private String lastName; /** * email address */ private String email; public long getId() { return id; } public void setId(long id) { this.id = id; } public String getFirstName() { return firstName; } public void setFirstName(String firstName) { this.firstName = firstName; } public String getLastName() { return lastName; } public void setLastName(String lastName) { this.lastName = lastName; } public String getEmail() { return email; } public void setEmail(String email) { this.email = email; } } Now create a service class Java @Repository public class UserRepository { private static final Map<Long, User> users = new ConcurrentHashMap<>(); static { User user = new User(); user.setId(1); user.setEmail("123@gmail.com"); user.setFirstName("Tom"); user.setLastName("King"); users.put(1L,user); } public Optional<User> findById(long id) { return Optional.ofNullable(users.get(id)); } public void add(User book) { users.put(book.getId(), book); } public List<User> getUsers() { return users.values().stream().collect(Collectors.toList()); } public boolean delete(User user) { return users.remove(user.getId(),user); } } Create the RestController Class. Java /** * The type User controller. * * @author yu 2020/12/27. */ @RestController @RequestMapping("/api/v1") public class UserController { @Resource private UserRepository userRepository; /** * Create user. * * @param user the user * @return the user */ @PostMapping("/users") public ResponseResult<User> createUser(@Valid @RequestBody User user) { userRepository.add(user); return ResponseResult.ok(user); } /** * Get all users list. * * @return the list */ @GetMapping("/users") public ResponseResult<List<User>> getAllUsers() { return ResponseResult.ok().setResultData(userRepository.getUsers()); } /** * Gets users by id. * * @param userId the user id|1 * @return the users by id */ @GetMapping("/users/{id}") public ResponseResult<User> getUsersById(@PathVariable(value = "id") Long userId) { User user = userRepository.findById(userId). orElseThrow(() -> new ResourceNotFoundException("User not found on :: " + userId)); return ResponseResult.ok().setResultData(user); } /** * Update user response entity. * * @param userId the user id|1 * @param userDetails the user details * @return the response entity */ @PutMapping("/users/{id}") public ResponseResult<User> updateUser(@PathVariable(value = "id") Long userId, @Valid @RequestBody User userDetails) { User user = userRepository.findById(userId). orElseThrow(() -> new ResourceNotFoundException("User not found on :: " + userId)); user.setEmail(userDetails.getEmail()); user.setLastName(userDetails.getLastName()); user.setFirstName(userDetails.getFirstName()); userRepository.add(user); return ResponseResult.ok().setResultData(user); } /** * Delete user. * * @param userId the user id|1 * @return the map */ @DeleteMapping("/user/{id}") public ResponseResult<Boolean> deleteUser(@PathVariable(value = "id") Long userId) { User user = userRepository.findById(userId). orElseThrow(() -> new ResourceNotFoundException("User not found on :: " + userId)); return ResponseResult.ok().setResultData(userRepository.delete(user)); } } Step 4: Generate a Document You can use the Smart-doc plugin in IntelliJ IDEA to generate the desired documentation, such as OpenAPI, Markdown, etc. Of course, you can also use the Maven command to generate: Shell mvn smart-doc:html // Generate document output to Markdown mvn smart-doc:markdown // Generate document output to Adoc mvn smart-doc:adoc // Generate Postman. mvn smart-doc:postman // Generate OpenAPI 3.0+ mvn smart-doc:openapi Step 4: Import to Postman Here we use Smart-doc to generate a Postman.json, then import it into Postman to see the effect. Since smart-doc supports generating documentation in multiple formats, you can choose to generate OpenAPI and then display it using Swagger UI or import it into some professional API documentation systems. Conclusion From the previous examples, it can be seen that Smart-doc generates documentation by scanning standard Java comments in the code, without the need for specialized annotations like Swagger, thus maintaining the simplicity and non-invasiveness of the code, and also not affecting the size of the service Jar package. It supports multiple formats for document output, including Markdown, HTML5, Postman Collection,OpenAPI 3.0, etc. This flexibility allows developers to choose the appropriate document format for output based on their needs. The Maven or Gradle plugins provided by smart-doc also facilitate users in integrating document generation in Devops pipelines. Currently, Swagger also has its advantages, such as more powerful UI features, and better support for Springboot Webflux.

By sun yu