Automating data workflows using Amazon Textract and Amazon Comprehend Medical deployed through Terraform

Table Of Contents

Overview

Automating data workflows using Amazon Textract and Amazon Comprehend Medical deployed through Terraform

Contents

Objective

Upload .png, .jpg, .pdf data to a S3 source bucket. Process sample medical data using Amazon Textract, Amazon Comprehend and store it in S3. Unlock the ability to query data using Amazon Athena and visualize using QuickSight.

Source Code

The GitHub repository is available here.

Process

To run a validation process using a valid claim form

  1. The input directory is automatically created for you, along with uploading the test file.
  2. Examine the CSV file in the S3 bucket’s “result” prefix. This CSV file should contain the fields extracted from the PNG file using Amazon Textract.
  3. Examine the CSV file in the S3 bucket’s “procedureresult” prefix. This CSV file should contain the fields extracted from “PROCEDURE” key from the sample template using Amazon Comprehend Medical

To run a validation process using an invalid claim form

For claims that fail validation, an email notification is sent to the user notifying them to fix the errors. To replicate this use case, use the sample PNG form as an example, which has an invalid Claim ID.

  1. Download and save the image as .png file and run the steps 1 and 2 mentioned above.
  2. Check your email for message from Amazon SNS. This time, instead of a CSV file in the “result” bucket, you should see an email notification instead.
  3. Optionally, Check the AWS Lambda execution logs in Amazon CloudWatch

Analyzing and Visualizing Claim data

To execute the analytics from claim data using Amazon Athena

  1. Run the query using Amazon Athena Query Editor

To setup a visualization for the claim procedure field

  1. Follow these steps to create data set using Amazon Athena data
  2. Define the visualization by selecting the parameters on the left.

To setup a visualization for the claim data

  1. Follow these steps to create data set using Amazon S3 for claim document entities
  2. Define visualization by selecting the parameters on the left

Diagram

Diagram

Requirements

  1. Terraform v.12 or later
  2. AWS Account
  3. AWS IAM credentials (set in the ~/.aws/credentials file) to deploy the required resources

Variables

variable  "aws_region"  {

type =  string

default =  "us-east-1"

description =  "AWS Region to deploy to"

}

  

variable  "aws_profile"  {

type =  string

description =  "AWS Profile to use credentials to deploy"

}

  

variable  "email"  {

type =  string

description =  "Email address used for notifications"

}

  

variable  "policy-attach"  {

default =  {

"arn:aws:iam::aws:policy/AmazonS3FullAccess" = 1,

"arn:aws:iam::aws:policy/AmazonSQSFullAccess" = 2,

"arn:aws:iam::aws:policy/AmazonSNSFullAccess" = 3,

"arn:aws:iam::aws:policy/AWSLambda_FullAccess" = 4,

"arn:aws:iam::aws:policy/AmazonTextractFullAccess" = 5,

"arn:aws:iam::aws:policy/ComprehendMedicalFullAccess" = 6,

"arn:aws:iam::aws:policy/CloudWatchFullAccess" = 7,

"arn:aws:iam::aws:policy/service-role/AWSLambdaBasicExecutionRole" = 8

}

}

Demo

Diagram

Share :

Related Posts

AWS Security: a curated list of repositories

AWS Security: a curated list of repositories

AWS Security: a curated list of repositories Overview These lists will curate some of the latest GitHub repositories that assist with deploying to AWS using Terraform.

Read More
AWS re:Invent 2020 Andy Jassy Keynote annoucements!

AWS re:Invent 2020 Andy Jassy Keynote annoucements!

AWS re:Invent 2020 - Andy Jassy Keynote Here’s a recap of what was announced in Andy Jassy’s 2020 AWS re:Invent Keynote; a lot of very exciting announcements to unpack here!

Read More
Conjure CloudWatch metrics in the CLI!

Conjure CloudWatch metrics in the CLI!

Visualizing CloudWatch Metrics in the command line using Node.js! Intro Capabilities AWS Lambda Amazon DynamoDB Amazon API Gateway Requirements Now, let’s see it in action! Setup & Installation First walkthrough Using the Linux watch command paired with awscii Intro awscii allows for the ability to stay in the command line when troubleshooting or the desire to retrive AWS CloudWatch Metrics. This open-source tool displays your accounts CloudWatch metrics in an easy to digest, updatable way!

Read More