• Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
    Dedicated community for Japanese speakers
  • 한국 커뮤니티
    Dedicated community for Korean speakers
Exit
0

Remove duplicates

Enthusiast ,
Nov 02, 2017 Nov 02, 2017

Copy link to clipboard

Copied

Hello there,

I have a story in InDesign that is a list.

This list have hours and information.

I want to remove the duplicate hours, but keep the information together with the previous that has the same hour.

For example, I want to format this:

16 horas - Informação 1

16 horas - Informação 2

16 horas - Informação 3

16h45 - Informação 4

16h50 - Informação 5

17h30 - Informação 6

17h30 - Informação 7

As this:

16 horas - Informação 1

Informação 2

Informação 3

16h45 - Informação 4

16h50 - Informação 5

17h30 - Informação 6

Informação 7

Anyone of you have a great idea to make this using javascript and GREP find/change?

Thanks!

TOPICS
Scripting

Views

3.1K

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines

correct answers 1 Correct answer

Guide , Nov 03, 2017 Nov 03, 2017

Hi If.corullon,

Just to add my two pennies, there is an interesting way of removing duplicated patterns at the beginning of paragraphs. The word ‘interesting’ does not pretend to mean ‘efficient’ but I think we may learn something from this 😉

Disregarding the specific \d+ horas and/or \d+h\d+ patterns involved in your example, I will focus on a more generic scheme for my demonstration. Let's just target strings like [^-]+-\s, that is, “anything before a hyphen, then the hyphen, then any space cha

...

Votes

Translate

Translate
Engaged ,
Nov 03, 2017 Nov 03, 2017

Copy link to clipboard

Copied

Didn't have much time to do much.

It only works properly for this scenario:

16 horas - Informação 1

16 horas - Informação 2

16 horas - Informação 3

Maybe someone else can improve it for you

var story = (app.selection[0] instanceof TextFrame && app.selection[0] ) || exit();

var grep = "\\S+(\\s\\S+)?\\s-\\s\\S+\\s\\d";

app.findGrepPreferences = app.changeGrepPreferences = null;

app.findGrepPreferences.findWhat = grep;

var found = story.findGrep();
var first = [];
var duplicate;
var duplicates = [];

    for ( var i = 0; i < found.length; i++ ) {
       
        +found.contents.slice (-1) === 1 && first.push ( found );
        +found.contents.slice (-1) !== 1 && duplicates.push ( found );
    }

    if ( first.length !== 0 ) {

        while ( duplicate = first.pop() ) {

            for ( var i = 0; i < duplicates.length; i++ ) {

                        duplicate.contents.substr (0, 11) == duplicates.contents.substr (0, 11) && duplicates.contents = duplicates.contents.substr (11, duplicates.contents.length);

            }
        }         
    }

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Enthusiast ,
Nov 03, 2017 Nov 03, 2017

Copy link to clipboard

Copied

I spent some hours trying to deal with it yesterday.

The way I did it work is:

app.findGrepPreferences = app.changeGrepPreferences = app.findTextPreferences = app.changeTextPreferences = NothingEnum.nothing;

app.findGrepPreferences.findWhat = "(?i)(^\\d+(\\shoras|h)(\\d+)*\\r).+?\\r";

var found = mySel.findGrep();

app.findGrepPreferences = app.changeGrepPreferences = app.findTextPreferences = app.changeTextPreferences = NothingEnum.nothing;

for (var i=found.length-1; i>0; i--) {

     if (found.contents == found[i-1].contents) {

          found.contents = "";

     }

}

app.findGrepPreferences = app.changeGrepPreferences = app.findTextPreferences = app.changeTextPreferences = NothingEnum.nothing;

Thank you so much!

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
Nov 03, 2017 Nov 03, 2017

Copy link to clipboard

Copied

Hi,

Maybe time for you to learn Grep! Take a look at this group: https://www.facebook.com/groups/TreasuresofGrep/

app.findGrepPreferences = null; 

app.findGrepPreferences.findWhat = "^\\d+(\\hhoras|(h\\d+))\\h-\\h"; 

var myFound = app.activeDocument.findGrep(),

F = myFound.length,  f;

for ( f = F-1; f > 0; f--) if ( myFound.contents == myFound[f-1].contents )  myFound.contents = "";

app.findGrepPreferences = null; 

(^/) 

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Enthusiast ,
Nov 03, 2017 Nov 03, 2017

Copy link to clipboard

Copied

My own solution worked, 'dear' Obi-wan Kenobi.

The group you suggested me has a big ******* guy and I hate his guts.

By the way, nobody can stand the idiot but people just don’t say anything. So I did.

I already been in this group. I leave it because the stupid comments and actions of this guy.

Avoid him!

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
Nov 03, 2017 Nov 03, 2017

Copy link to clipboard

Copied

Oh! I'm sad for you! … But your Grep code you give is truly not good and I'm very surprised you find something! …

(?i)(^\\d+(\\shoras|h)(\\d+)*\\r).+?\\r

(^/)

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Enthusiast ,
Nov 03, 2017 Nov 03, 2017

Copy link to clipboard

Copied

Yes. It's working!

https://youtu.be/Nze2IwTSYps

By the way, let your friend from that group he is not the owner of the

truth.

Not always the things works only in the way he thinks.

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
Nov 03, 2017 Nov 03, 2017

Copy link to clipboard

Copied

Glad it works for you! … But not for me! … and your YouTube link too! … Apparently, not my day! 

(^/)

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Guide ,
Nov 03, 2017 Nov 03, 2017

Copy link to clipboard

Copied

Hi If.corullon,

Just to add my two pennies, there is an interesting way of removing duplicated patterns at the beginning of paragraphs. The word ‘interesting’ does not pretend to mean ‘efficient’ but I think we may learn something from this 😉

Disregarding the specific \d+ horas and/or \d+h\d+ patterns involved in your example, I will focus on a more generic scheme for my demonstration. Let's just target strings like [^-]+-\s, that is, “anything before a hyphen, then the hyphen, then any space character.” Our goal is to detect and remove such pattern when it is repeated at the beginning of successive paragraphs.

My first idea was to use something like ^([^-]+-\s)[^\r]+\r\K\1 , but this only selects the 2nd instance of the repeated pattern, so that wouldn't work at all for arbitrary number of dups. Now the funny trick is just a slight variation of the previous regex, /^([^-]+-\s)([^\r]+\r\K\1)+/ , which then magically selects the very last duplicate of the \1 capture!

From then we can design a recurring changeGrep() command that never needs to visit the found elements. Simply replace the captures by what you want and loop until changeGrep() returns an empty array. In the code below I use a tab ('\t') as changeTo parameter and my target is app.selection[0] since I have a TextFrame selected (of course you could use any other target, including app😞

const GREP = /^([^-]+-\s)([^\r]+\r\K\1)+/;

(function(target)

{

    app.findGrepPreferences = app.changeGrepPreferences = null;

   

    app.findGrepPreferences.findWhat = GREP.source;

    app.changeGrepPreferences.changeTo = '\t';

    while( target.changeGrep().length );

   

    app.findGrepPreferences = app.changeGrepPreferences = null;

   

})(app.selection[0]);

Here is how we can picture the iterative process (all is done in two steps):

Grep-Repeat.gif

Hope that helps,

@+

Marc

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Enthusiast ,
Nov 03, 2017 Nov 03, 2017

Copy link to clipboard

Copied

Absolutely awesome, Marc!

Thank you so much.

Of course it's a much more clever way to deal with it.

Thanks for share!

PS: the use of "const" instead of "var" is to avoid the reassignement of that value?

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Guide ,
Nov 03, 2017 Nov 03, 2017

Copy link to clipboard

Copied

PS: the use of "const" instead of "var" is to avoid the reassignement of that value?

In some way, but actually there is no critical reason to use const in that context—in fact I often use the const keyword as a semantic marker. (It is useful in larger projects though, where debug mode and/or persistent engines are involved.)

@+

Marc

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
Nov 03, 2017 Nov 03, 2017

Copy link to clipboard

Copied

LATEST

Nice function, Marc!

(^/) 

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines