Edit file

File name : DocumentWriter.pm

Content :

package Plucene::Index::DocumentWriter;

=head1 NAME

Plucene::Index::DocumentWriter - the document writer

=head1 SYNOPSIS

my $writer = Plucene::Index::DocumentWriter
		->new($directory, $analyser, $max_field_length);

$writer->add_document($segment, $doc);

=head1 DESCRIPTION

This is the document writer class.

=head2 METHODS

=cut

use strict;
use warnings;

use File::Slurp;
use Plucene::Index::FieldInfos;
use Plucene::Index::FieldsWriter;
use Plucene::Index::Term;
use Plucene::Index::TermInfo;
use Plucene::Index::TermInfosWriter;
use Plucene::Search::Similarity;
use Plucene::Store::OutputStream;

use IO::Scalar;

=head2 new

my $writer = Plucene::Index::DocumentWriter
		->new($directory, $analyser, $max_field_length);

This will create a new Plucene::Index::DocumentWriter object with the passed
in arguments.

=cut

sub new {
	my ($self, $d, $a, $mfl) = @_;
	bless {
		directory        => $d,
		analyzer         => $a,
		max_field_length => $mfl,
		postings         => {},
	}, $self;
}

=head2 add_document

$writer->add_document($segment, $doc);

=cut

sub add_document {
	my ($self, $segment, $doc) = @_;
	my $fi = Plucene::Index::FieldInfos->new();
	$fi->add($doc);
	$fi->write("$self->{directory}/$segment.fnm");
	$self->{field_infos} = $fi;

my $fw =
		Plucene::Index::FieldsWriter->new($self->{directory}, $segment, $fi);
	$fw->add_document($doc);
	$self->{postings}      = {};
	$self->{field_lengths} = [];
	$self->_invert_document($doc);
	my @postings = sort {
		     $a->{term}->{field} cmp $b->{term}->{field}
			|| $a->{term}->{text} cmp $b->{term}->{text}
	} values %{ $self->{postings} };

$self->_write_postings($segment, @postings);
	$self->_write_norms($doc, $segment);
}

sub _invert_document {
	my ($self, $doc) = @_;
	for my $field (grep $_->is_indexed, $doc->fields) {
		my $name = $field->name;
		my $fn   = $self->{field_infos}->field_number($name);
		my $pos  = $self->{field_lengths}->[$fn];
		if (!$field->is_tokenized) {
			$self->_add_position($name, $field->string, $pos++);
		} else {
			my $reader = $field->reader
				|| IO::Scalar->new(\$field->{string});
			my $stream = $self->{analyzer}->tokenstream({
					field  => $name,
					reader => $reader
				});
			while (my $t = $stream->next) {
				$self->_add_position($name, $t->text, $pos++);
				last if $pos > $self->{max_field_length};
			}
		}
		$self->{field_lengths}->[$fn] = $pos;
	}
}

sub _add_position {
	my ($self, $field, $text, $pos) = @_;
	my $ti = $self->{postings}->{"$field\0$text"};
	if ($ti) {
		$ti->{positions}->[ $ti->freq ] = $pos;
		$ti->{freq}++;
		return;
	}
	$self->{postings}->{"$field\0$text"} = Plucene::Index::Posting->new({
			term => Plucene::Index::Term->new({ field => $field, text => $text }),
			positions => [$pos],
			freq      => 1,
		});
}

sub _write_postings {
	my ($self, $segment, @postings) = @_;
	my (@freqs, @proxs);
	my $tis =
		Plucene::Index::TermInfosWriter->new($self->{directory}, $segment,
		$self->{field_infos});
	my $ti = Plucene::Index::TermInfo->new();

for my $posting (@postings) {
		$ti->doc_freq(1);
		$ti->freq_pointer(scalar @freqs);
		$ti->prox_pointer(scalar @proxs);

$tis->add($posting->term, $ti);
		my $f = $posting->freq;
		push @freqs, ($f == 1) ? 1 : (0, $f);
		my $last_pos  = 0;
		my $positions = $posting->positions;
		for my $j (0 .. $f - 1) {
			my $pos = $positions->[$j] || 0;
			push @proxs, $pos - $last_pos;
			$last_pos = $pos;
		}
	}

write_file("$self->{directory}/$segment.frq" => pack('(w)*', @freqs));
	write_file("$self->{directory}/$segment.prx" => pack('(w)*', @proxs));
	$tis->break_ref;
}

sub _write_norms {
	my ($self, $doc, $segment) = @_;
	for my $field (grep $_->is_indexed, $doc->fields) {
		my $fn = $self->{field_infos}->field_number($field->name);
		warn "Couldn't find field @{[ $field->name ]} in list [ @{[ map
			$_->name, $self->{field_infos}->fields]}]" unless $fn >= 0;
		my $norm =
			Plucene::Store::OutputStream->new("$self->{directory}/$segment.f$fn");
		my $val      = $self->{field_lengths}[$fn];
		my $norm_val = Plucene::Search::Similarity->norm($val);
		$norm->print(chr($norm_val));
	}
}

package Plucene::Index::Posting;

use base 'Class::Accessor::Fast';

__PACKAGE__->mk_accessors(qw( term freq positions ));

DATA DRIVEN ECOMMERCE

Ready to improve your ecommerce business’ performance?

Get better lead generation, engagement and enhanced ROI with a more customer centered approach.

See our services

Take this quiz

Services

Our Services

Let us help boost your internet exposure, turn more visitors into clients, save your time and increase your revenue. We’ve got a range of products and services to help you reach your goal.

For more information, click the one which suits your needs.

Build Optimise Serve Systemise
Build

Build your ecommerce website

“A website is a window through which your business says hello to the world.”

Learn more

Optimise

Optimise your website and other online platforms

“Google only loves you when everyone else loves you first.” ~ Wendy Piersall

Learn more

Serve

Serve and engage your audience

Provide them with the best services and make them loyal fans of your products.

Learn more

Systemise

Apply a systemised approach

Systemise and set up your actionable metrics by creating a Growth Scorecard.

Learn more

Build your ecommerce website

“A website is a window through which your business says hello to the world.”

Learn more

Optimise your website and other online platforms

“Google only loves you when everyone else loves you first.” ~ Wendy Piersall

Learn more

Serve and engage your audience

Provide them with the best services and make them loyal fans of your products.

Learn more

Apply a systemised approach

Systemise and set up your actionable metrics by creating a Growth Scorecard.

Learn more

metaverse marketing

Strategy

Boost your online sales

Providing an immersive experience

As Web3 is gradually gaining a foothold on the web, we’ve also structured our marketing solutions and strategies with you in mind. You tell us what you want to achieve and we will run an analysis of your ecommerce business in order to give you pointers on what to improve on, in order to boost your sales.

Not only that, we also offer done for you services, geared towards relieving you of the burden of having to figure things out by yourself.

Learn More

Latest Posts

Testimonials

What People Are Saying

Eva and I met at personal development course and renmained in contact. Over the years, I was fascinated by the care, compassion and energy she put into everything she does. She’s always there for you supporting and giving ideas to help you to carry on in whatever you are engage with.
Even though is she quite technical, she is nurturing and doen’t zoom straight into the technology part of it until necessary. She helped me to clearly identify my customer’s avatar and product pathways. Her step-by-step advice was invaluable as we embarked on the technical implementation itself.

Johanna Tresierra
Educator, Spanish Mentor and Entrepreneur, SpanishwithJo Ltd
I was referred to Eva by a friend. After our discussion, I took the opportunity to work with her to grow my business online and increase my revenue especially since I did not consider myself to be very technical and wanted a very personalised service.
Previously, I mainly showcased my products at in person events and definitely had to pivot my business online due to lockdown-Covid-19. Since working with Eva, I have launched some of my products on Amazon and we are now working on a Q4 promotion plan to add a subscription model to my business. Eva offers a personalised service including a hand holding teaching/mentoring approach which I vitally need at this stage in my business.

Jacalyn Belgrave
Owner, Ndulge Skincare Products Ltd

Like what you’ve seen so far?

Let’s Talk!

Click here

Gel4y Mini Shell

Edit file

Ready to improve your ecommerce business’ performance?

Services

Our Services

Strategy

Boost your online sales

Providing an immersive experience

Latest Posts

Testimonials

What People Are Saying

Like what you’ve seen so far?

Let’s Talk!