This is the Scala implementation of referer-parser, the library for extracting attribution data from referer (sic) URLs.
The implementation uses a JSON version of the shared 'database' of known referers found in referers.yml
.
The Scala implementation is a core component of Snowplow, the open-source web-scale analytics platform.
You can add the following to your SBT config:
val refererParser = "com.snowplowanalytics" %% "scala-referer-parser" % "2.0.0"
You can provide wrappers for effects, such as Sync
, Eval
or Id
from cats-effect. In the examples below we use IO
.
import com.snowplowanalytics.refererparser._
import cats.effect.IO
import cats.data.EitherT
import java.net.URI
val refererUrl = "http://www.google.com/search?q=gateway+oracle+cards+denise+linn&hl=en&client=safari"
val pageUrl = "http:/www.psychicbazaar.com/shop" // Our current URL
val referersJsonPath = "/opt/referers/referers.json"
// We use EitherT to handle exceptions. The IO routine will short circuit if an exception is returned.
val io: EitherT[IO, Exception, Unit] = for {
// We can instantiate a new Parser instance with Parser.create
parser <- EitherT(CreateParser[IO].create(referersJsonPath))
// Referer is a sealed hierarchy of different referer types
referer1 <- EitherT.fromOption[IO](parser.parse(refererUrl, pageUrl),
new Exception("No parseable referer"))
_ <- EitherT.right(IO { println(referer1) })
// => SearchReferer(SearchMedium,Google,Some(gateway oracle cards denise linn))
// You can provide a list of domains which should be considered internal
referer2 <- EitherT.fromOption[IO](parser.parse(
new URI("http://www.subdomain1.snowplowanalytics.com"),
Some("http://www.snowplowanalytics.com"),
List("www.subdomain1.snowplowanalytics.com", "www.subdomain2.snowplowanalytics.com")
), new Exception("No parseable referer"))
_ <- EitherT.right(IO { println(referer2) })
// => InternalReferer(InternalMedium)
// Various overloads are available for common cases, for instance
maybeReferer1 = parser.parse("https://www.bing.com/search?q=snowplow")
maybeReferer2 = parser.parse(new URI("https://www.bing.com/search?q=snowplow"), None, Nil)
_ <- EitherT.right(IO { println( maybeReferer1 == maybeReferer2 ) }) // => true
} yield Unit
io.value.unsafeRunSync()
More examples can be seen in ParseTest.scala. See Parser.scala for all overloads.
Check out our contributing guide.
The referer-parser Java/Scala library is copyright 2012-2022 Snowplow Analytics Ltd.
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this software except in compliance with the License.
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.