hexo

Terraform-AWS serverless blog

Iā€™m learning Terraform at the moment and thought this could be a good hand-on side project for me. The provided terraform code will spin up a github repo, a codebuild project and a s3 bucket to host a static blog (blue box in the flow chart above). I figure people might not want to use cloudfront or route 53 as they are not free tier service, so I left them out.

To spin this up, we will need the below prerequisites:

Once all the prerequisites are setup, follow the steps below.

  1. Open cmd/powershell and run the following commands to clone terraform and build spec file:
1
git clone https://github.com/tduong10101/serverless-blog-terra.git
  1. Update serverless-blog-terra/variable.tfvars with your github token and site name that you would like set up
  2. Run the following commands
1
2
3
cd serverless-blog-terra
terraform init
terraform apply -var-file variable.tfvars
  1. Review the resouces and put in ā€œyesā€ to approve terraform to spin them up.
  2. Grab the outputs and save them somewhere, weā€™ll use them for later steps.
  3. Navigate to the parent folder of serverless-blog-terra
1
cd ..
  1. Create a new folder, give it the same name as git repo (doesnā€™t matter if the is not the same, itā€™s just easier to manage), cd to new folder and run hexo init command

    1
    2
    3
    mkdir <new folder>
    cd .\<new folder>
    hexo init
  2. Copy buildspec.yml file from serverless-blog-terra folder to this new folder

  3. Update the buildspec.yml with s3:// link from step 5

  4. Init Git and setup git remote with the below commands. Insert your git repo url from step 5.

1
2
3
4
5
git init
git add *
git commit -m "init"
git remote add origin "<your-git-url-from-step-5>"
git push -u origin master
  1. Wait for codebuild to complete update S3 bucket. Logon to AWS console to confirm.
  1. Open the website_endpoint url on step 5 and enjoy your serverless blog.

Visit Hexo for instructions on how to create posts, change theme, add plugins etc

Remove the blog:

  1. If you donā€™t like the new blog and want to clean up aws/git resources. Run the below command:
1
terraform destroy -var-file variable.tfvars
  1. Once terraform finish cleaning up the resources. The rest of the folders can be removed from local computer.

My random podcast app

Iā€™ve been trying to catch up with a few podcasts and canā€™t decide what to listen to first. So I thought, let create an app that could pick out a random episode for me. Less thinking about picking and more time listening!

So here is what I came up with.

I thought it would be straight forward but it took me the whole weekend to get it up T__T

There are 4 lambda functions in this app.

1- update-station: trigger whenever a new item is added to stationsDB. This will crawl the site main page to get episode playlist and insert that back to stationsDB as list_url

2- update-episode: trigger by update-station function or a monthly cloudwatch event. This function will loop through the stationsdb and run the itemā€™s spider fucntion on its list_url. The output would be a list of 50 most recent episodes for each stations. This list would then get compare with all episodes added to episodesDB. The differences would then get added to episodesDB

3- gen-random-episode: trigger by api gateway when an episode is finished playing at https://blog.tdinvoke.net/random-podcast/. This funciton would first change the current episode status ā€˜completedā€™. Then it would pull out all episodes url from episodeDB that havenā€™t play (with blank status). Random pick out 1 episode and change its status to current.

4- get-current-episode: trigger by api gateway when the page https://blog.tdinvoke.net/random-podcast/ is loaded. This one is simple, pull episode with ā€˜currentā€™ status.

You can find the codes here

To see the app in action, please visit here

Issues encountered/thoughts:

  • Add a UI page to modify the station DB. Iā€™ll have to workout how to put authorisation in API call to add new station.
  • Split crawler functions into separate lambda functions which make the functions clean and easy to manage.
  • Add more crawler.At the moment, this app only crawl playerfm stations.
  • Learnt how to add js scripts to Hexo. There arnā€™t much information on how to it out there. I had to hack around for awhile. Basically, I need to create a new script folder at thems/ā€˜my-themeā€™/source/ā€˜td-podcastā€™. Chuck all my js scripts in there, then modify ā€˜_partials/scripts.ejsā€™ to reference the source folder. Learnt a bit of ejs as well.
  • Chalice doesnā€™t have Dynamodb stream trigger, gave up halfway and gone back to create the lambda functions manually.
  • Looking into SAM and CloudFormation to do CI/CD on this.
  • Could turn this into youtube/twitch random video. Looking into Youtube Google api and Twitch api.

Serverless Blog

So this blog is serverless using combination of hexo, s3, github, codebuild, route53 and cloudfront. My original plan was to build the blog from the ground up with lambda chalice, dynamodb and some hacking with java script. But I thought there got to be someone with the same idea somewhere. One search on google and found two wonderful guides from hackernoon and greengocloud. Thanks to the guides I was able to spin this up within 4-5 hours. Iā€™m still getting use to Hexo and markdown but feeling pretty good that I got it working.

I was struggling a bit with git, the theme didnā€™t get committed properly. Removed Git Submodule sorted the issue out.

Also CodeBuild didnā€™t play nice with default role, got to give the role fullS3access to the bucket. Itā€™s working like charm now.

PS: This blog use Chan theme by denjones