Need suggestion of Apache_beam learning materials for beginners - apache-beam

I need to learn Apache Beam for a project. I have gone through the
Apache Beam documentation and I think this is not enough. Can someone recommend resources to learn Apache Beam?

Have you tried running through the Python/Java quick start tutorials?
You can also go through an explanation of the wordcount pipelines, so you'll become more familiar with some of the transforms available in beam.
Once you have a look at the specific wordcount example, you can dive in the programming guide to get a more detailed understanding of the framework.

Related

How to test APIs with Scala (SDET Automation)?

I am new to Scala (came from a Java background). I am a SDET. I have learned the syntax through courses on Pluralsight/Udemy and now I want to learn how to test REST APIs using Scala.
I have read online about Akka-http-testkit, Play, ScalaTest, and I have looked at some repo's on Git, but honestly there is no tutorial I have found that really explains things and the documentation for these frameworks is not something I can wrap my head around. Then there is things about Actors having a service and a server, etc. ... I'm lost!
Is there any help I can get that will really give me a step by step on explanation and execution? Many thanks.
Check this channel to know details about Scala and development with testing .
Coming to your questions - Here are the video1, video2 and code link how to use ScalaTest framework.

Do using Smile library over H2O is good idea on the scale of performances?

Walkthrough of Smile performance as compared to H2O.
Smile - Statistical Machine Intelligence and Learning Engine.
I Want to use Smile library for Constructing Pipeline using word2vec model.
If you want to use H2O word2vec algorithm you can use Sparkling Water samples here which are writing in Scala/Java, very easy to make them work on your Scala code.
I don't have much experience with SMILE library however looking their code you can find some of the Java samples in their test code here into nlp section. You can try them to use with Scala as well.
I hope you know what you want to do because they way you describe your question is not exactly clear. What you really need is to ask a very specific question after writing some code first. This will help you to understand exactly where you are stuck and what is needed to unblock you.

Diagram tool for actors

Is there any thing that can generate a sequence diagram between actors in akka?
Or any kind of diagram in akka?
I use sbt 0.13.13 with scala 2.11.8, akka 2.4.16
The idea of general visualization of akka actors (mailboxes) is quite fresh, so there is not much in terms of existing frameworks for it. The one I came around and seemed closest to kind of general solution was akka-visualmailbox.
They also have sample project and comprehensive installation instructions in README.
The best part of this one is that it does not require code changes.
Another one that does is akka-message-visualization however I did not see any simple setup guides out there for it.
Use Lightbend Telemetry Visualization tools
Specially look into Vizceral Visualization tool
I'm currently building one called Muse, it's still in early stages, but I have already begun using it at the office. You can select one or more messages and generate a sequence diagram from them.

Is there any book or online tutorial for Scala DSL?

I read this one for DSL scala http://camel.apache.org/scala-dsl-getting-started.html. It doesn't help much for me. I want more details. Can any one help me out?
There is Debashish Gnosh book DSLs in action. It isn't focused entirely on Scala, but has a good section on DSL developemnt using Scala. Generally this would be enough for starter knowledge, for deeper experience I would look at either ScalaTest or Specs source code.
There is a lot of research ongoing in this area, especially regarding high-performing Scala DSLs. Look at:
Lightweight Modular Staging
Delite
I've published an article about building regular DSLs in Scala.
A short overview: most-likely the DSL you're trying to build belongs to a class of regular languages. If so, you can model this DSL with a finite state machine. The article contains a step-by-step manual how to transform that FSM to Scala code.

Please suggest direction for my small scala project

As a academic project of 6 months in college me and my 3 friends are going to implement "Distributed Caching" in scala language.
Being new to both of these concepts and this being our first project I would be really happy if you guys could provide some direction.
I am currently learning scala.
Please let me know which particular features of language to be learned for this particular project.
Any online resources for learning distributed caching.
thanks in advance
You could have a look at Terracotta and especially at its uses in implementing Distributed Caching. You could have a look at the source code of the open source edition of Terracotta. Also, you could even consider Terracotta as your framework for building the distributed cache. I don't have any personal experience in using Terracotta with Scala, but it has been done.
Features of the language... Try starting with the Programming in Scala book. It's a very good resource. If you want to do any concurrency you will have to be proficient in using Actors. I would recommend having a look over all the features of Scala. Each one has its uses and you will need to know at least a bit of them to recognise situations in which to use their power. :)
-- Flaviu Cipcigan
You might want to look at the project Velocity page.
In MSDN also there is an article about distributed caching in general.
I'm not sure, but I think the Akka project might is already doing what you're looking for (and a whole lot more). Perhaps you can take inspiration from that.